tolower option

B. Plank bplank at science.uva.nl
Mon Mar 12 10:05:49 PDT 2007


Dear SRILM mailing list,

I am wondering.. when I try to train a language model with ngram-count and
the –tolower option,
I’m getting the following error:

assertion "i < maxWordLength" failed: file "Vocab.cc", line 97

The input corpus (-text) is an utf8 file. Might this cause the problem?

I am grateful for any suggestion.

Barbara






More information about the SRILM-User mailing list