[SRILM User List] how are the probabilities computed in ngram-count

Saman Noorzadeh saman_2004 at yahoo.com
Tue Apr 10 01:29:37 PDT 2012


Hello
I am getting confused about the models that ngram-count make:
ngram-count -order 2  -write-vocab vocabulary.voc -text mytext.txt   -write model1.bo
ngram-count -order 2  -read model1.bo -lm model2.BO

forexample: (the text is very large and these words are just a sample)


in model1.bo:
cook   14 

cook was 1

in model2.BO:
-1.904738  cook was 

my question is that the probability of 'cook was' bigram should be log10(1/14), but ngram-count result shows: log(1/80)== -1.9047
how is these probabilities computed?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20120410/991f8ee3/attachment.html>


More information about the SRILM-User mailing list