Warning message

Andreas Stolcke stolcke at speech.sri.com
Wed Sep 20 21:49:43 PDT 2006


In message <20060919160016.52311.qmail at web36806.mail.mud.yahoo.com>you wrote:
> --0-789738089-1158681616=:50607
> Content-Type: text/plain; charset=iso-8859-1
> Content-Transfer-Encoding: 8bit
> 
> I am developing language models of different order (2 to 5) with Good-Turing 
> discounting and Katz backoff for Smoothing.  I all cases, I have got the foll
> owing warning message:
>     discount coeff 1 is out of range : 6.2135e-17
> 
> I could not get the reason for the warning message. I develop language models
>  5 days ago using the same data and smoothing techniques, but this warning  m
> essage was no there.

Something must have changed.  What was it?  Has the software been updated?

> 
> Could you please tell me the reason behind? Does it affect the quality of my 
> language models?

The warning is issued because discount coefficients (the factors by which
the maximum likelihood estimates are reduced) should be between 0 and 1.
The value you are getting is effectively zero.  It indicates an
anomaly (non-smoothness) in the count-of-count of your data.

--Andreas 




More information about the SRILM-User mailing list