[SRILM User List] OOV terminology

Joris Pelemans Joris.Pelemans at esat.kuleuven.be
Wed Jul 3 11:22:01 PDT 2013

Hello all,

My question is perhaps a little bit of topic, but I'm hoping for your 
cooperation, since it's LM related.

Say we have a training corpus with lexicon V_train. Since some of the 
words have near-zero counts, we choose to exclude them from our LM. This 
gives us a new lexicon, let's call it V_final. However this also gives 
us two types of OOV words: those not in V_train and those not in 
V_final. I was wondering whether there are standard terms in the 
literature for these two types of OOVs. I have read my share of papers, 
but none of them seem to make this distinction.

Kind regards,


