[SRILM User List] Interpolation of Unigrams
Andreas Stolcke
stolcke at icsi.berkeley.edu
Sat Dec 15 23:34:37 PST 2012
On 12/15/2012 6:48 AM, Mohammed Mediani wrote:
> Hi,
> Are the unigrams always interpolated with 0-gram (probability of any
> word from the vocab)?
> I got the same probabilities for unigrams with and without
> -interpolate (both with -kndiscount). Is it meant to be this way?
> Many thanks for your help.
> Mohammed
The KN discounting strategy for unigrams only interpolates with the
zero-gram (uniform) estimate if the -interpolate flag is given.
This is just a special case of the interpolation happening at all
N-vgram levels.
However, there is an independent step whereby unallocated unigram
probability mass is filled in by adding a uniform probability increment
to all words in the vocabulary. When this happens you see a message like
warning: distributing 0.0659302 left-over probability mass over all
26573 words
This happens for unigrams only, and regardless of what discounting
method is in effect, because otherwise that probability mass would be
"lost" and the model would be deficient.
It so happens that the effect of both strategies is the same when it
comes to unigrams, and that explains your observation.
Andreas
More information about the SRILM-User
mailing list