Kneser-Ney context counts
Alexy Khrabrov
deliverable at gmail.com
Wed Jan 7 16:43:13 PST 2009
On Jan 3, 2009, at 11:14 PM, Andreas Stolcke wrote:
> Alexy Khrabrov wrote:
>> Greetings Andreas -- I'd like to access the number of contexts for
>> any given ngram, used in Kneser-Ney computation (those with the fat
>> dot). What's a good way to get at them via the C++ API?
> You create a ModKneserNey object (Discount.h). Be sure to leave the
> countsAreModified parameter at the default value (false).
>
> Then invoke ModKneserNey:: estimate() on your counts. As a side
> effect, the lower-order counts will be modified to reflect the
> context type counts. Note that the counts of ngrams starting with
> <s> are unchanged since there are no preceding words for them.
OK. I am also wondering, whether the number of contexts can be
reverse-engineered from the kncounts file -- since we have both counts
and kncounts?
Cheers,
Alexy
More information about the SRILM-User
mailing list