[SRILM User List] OOV terminology
Joris Pelemans
Joris.Pelemans at esat.kuleuven.be
Wed Jul 3 11:22:01 PDT 2013
Hello all,
My question is perhaps a little bit of topic, but I'm hoping for your
cooperation, since it's LM related.
Say we have a training corpus with lexicon V_train. Since some of the
words have near-zero counts, we choose to exclude them from our LM. This
gives us a new lexicon, let's call it V_final. However this also gives
us two types of OOV words: those not in V_train and those not in
V_final. I was wondering whether there are standard terms in the
literature for these two types of OOVs. I have read my share of papers,
but none of them seem to make this distinction.
Kind regards,
Joris
More information about the SRILM-User
mailing list