[SRILM User List] big difference between ppl and ppl1
Burkay Gur
burkay at mit.edu
Tue Dec 27 00:56:32 PST 2011
Is your Dutch model arranged so that there is one sentence on each line? Also which command are you using? I recommend using -gt1max 1 -gt2max 1 -gt3max 1 and -ukndiscount for kneser ney smoothing. These will give you more accurate perplexities.
-Burkay
Sent from my iPad
On Dec 27, 2011, at 6:26 AM, Saman Noorzadeh <saman_2004 at yahoo.com> wrote:
>
> I made 2 models of 2 languages, Dutch and English, to make a language recognition.
> I got the following perplexities:
>
> Model: Dutch Test: English ppl:55 ppl2: 2* 10^18
> Model: Dutch Test: Dutch ppl:303 ppl2: 400
> Model: English Test: Dutch ppl: 600 ppl2: 3122ses n
> Model: English Test: English ppl: 227 ppl2: 1897
>
> I think it is reasonable if I have a large perplexity when my model and test are different but why ppl=55 when having a Duch model and an English test?
> and
> Why is there a BIG difference in their ppl and ppl1 ?
>
> Thanks in advance
>
>
> _______________________________________________
> SRILM-User site list
> SRILM-User at speech.sri.com
> http://www.speech.sri.com/mailman/listinfo/srilm-user
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20111227/e0c542b8/attachment.html>
More information about the SRILM-User
mailing list