[SRILM User List] how are the probabilities computed in ngram-count
Saman Noorzadeh
saman_2004 at yahoo.com
Tue Apr 10 01:29:37 PDT 2012
Hello
I am getting confused about the models that ngram-count make:
ngram-count -order 2 -write-vocab vocabulary.voc -text mytext.txt -write model1.bo
ngram-count -order 2 -read model1.bo -lm model2.BO
forexample: (the text is very large and these words are just a sample)
in model1.bo:
cook 14
cook was 1
in model2.BO:
-1.904738 cook was
my question is that the probability of 'cook was' bigram should be log10(1/14), but ngram-count result shows: log(1/80)== -1.9047
how is these probabilities computed?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20120410/991f8ee3/attachment.html>
More information about the SRILM-User
mailing list