[SRILM User List] Fwd: Fwd: ngram-count

Andreas Stolcke stolcke at speech.sri.com
Thu Jan 14 10:01:03 PST 2010


On 1/14/2010 8:49 AM, Manuel Alves wrote:
>     p( </s> | . ...)     =  0.999997 [ -1.32346e-06 ]

You have a very strange LM since almost all the probability mass in your 
LM is on the end-of-sentence tag.
How many words are in your training corpus?
How many unigrams, bigrams, and trigrams are in your LM?
I suspect some basic with the preparation of your training data.

Andreas



More information about the SRILM-User mailing list