Kneser-Ney context counts

Alexy Khrabrov deliverable at gmail.com
Wed Jan 7 16:43:13 PST 2009


On Jan 3, 2009, at 11:14 PM, Andreas Stolcke wrote:

> Alexy Khrabrov wrote:
>> Greetings Andreas -- I'd like to access the number of contexts for  
>> any given ngram, used in Kneser-Ney computation (those with the fat  
>> dot).  What's a good way to get at them via the C++ API?
> You create a ModKneserNey object (Discount.h). Be sure to leave the  
> countsAreModified parameter at the default value (false).
>
> Then invoke ModKneserNey:: estimate() on your counts.  As  a side  
> effect, the lower-order counts will be modified to reflect the  
> context type counts.  Note that the counts of ngrams starting with  
> <s> are unchanged since there are no preceding words for them.

OK.   I am also wondering, whether the number of contexts can be  
reverse-engineered from the kncounts file -- since we have both counts  
and kncounts?

Cheers,
Alexy



More information about the SRILM-User mailing list