question about vocabulary

lavecchia Caroline.Lavecchia at loria.fr
Tue May 4 07:18:11 PDT 2004


Hello everybody,

I would like to know if it's possible with the SRILM toolkit to generate
a vocabulary with the 20000 most frequent words of a corpus for example. 

I know that with -write-vocab  in the ngram-count function I can
generate a vocabulary but only with all the words of the corpus.

Thanks in advance and sorry for my bad english, 

Caroline L.



More information about the SRILM-User mailing list