[SRILM User List] class based model

DUGAST Loic dugast at systran.fr
Mon Jan 6 07:45:35 PST 2014


In the FAQ (http://www.speech.sri.com/projects/srilm/manpages/srilm-faq.7.html)

You advise to ...

Lower the minimum counts for N-grams included in the LM, i.e., the values of the options -gt2min, -gt3min, -gt4min, etc. The higher order N-grams typically get higher minimum counts.

Do you not mean : *rise* the minimum counts (...) instead ?

Plus I am not sure to understand why gt2min should be set higher than gt1min etc ?
Higher-order ngrams  are naturally less frequent. Therefore the same cutoff value (gt2min equal to gt1min)will be harsher to bigrams than to unigrams... Can you explain ?

Thank you!


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20140106/f201325e/attachment.html>

More information about the SRILM-User mailing list