Open-vocabulary LM

Amélie DELTOUR amelie.deltour at ira.uka.de
Tue Feb 25 08:13:00 PST 2003


Hi,
Is it normal that in an open-vocabulary LM (built with the "-unk" 
option) the <unk> token is present as unigram, but not in bigrams and 
trigrams?
(Sorry if this is a silly question, but I am not so familiar with 
language models, and I was told that it would not be the case with other 
toolkits).
Thanks again,

Amélie

-- 
--------------------------------------------------------------------
Amélie DELTOUR
ENSIMAG / Universität Karlsruhe
E-mail : amelie.deltour at ira.uka.de
--------------------------------------------------------------------





More information about the SRILM-User mailing list