FLM

Antoine Ghaoui Antoine.Ghaoui at jinny.ie
Mon Apr 9 03:14:02 PDT 2007


Hello,

I'm using FLM to test some models.

I'm using the same data and the same vocabulary in both tools, ngram- 
count and fngram-count.
I'm not able to generate the same trigram model.
The number of bigram and trigram in the LM files generated are  
different.

using ngram-count, I'm getting:
\data\
ngram 1=315
ngram 2=23800
ngram 3=120408

using fngram-count, I'm getting:
\data\
ngram 0x0=315
ngram 0x1=23523
ngram 0x2=0
ngram 0x3=86366

knowing that ngram-count is used with the default parameters and the  
factor file for the fngram-count is:

##rule trigram
1
U : 2 U(-1) U(-2) ntextfile.flm.cnt ntextfile.flm.lm 3
U1U2	U2	wbdiscount	gtmin 3	interpolate
U1	U1	wbdiscount	gtmin	1	interpolate
0	0

What are the parameters  to use in the factor file in order to get  
the same LM output?


Thanks

Antoine

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20070409/0a9794b8/attachment.html>


More information about the SRILM-User mailing list