tolower option

B. Plank bplank at
Mon Mar 12 10:05:49 PDT 2007

Dear SRILM mailing list,

I am wondering.. when I try to train a language model with ngram-count and
the –tolower option,
I’m getting the following error:

assertion "i < maxWordLength" failed: file "", line 97

The input corpus (-text) is an utf8 file. Might this cause the problem?

I am grateful for any suggestion.


More information about the SRILM-User mailing list