[SRILM User List] lm interpolation

Mon Oct 29 12:46:33 PDT 2012

On 10/29/2012 9:15 AM, Stefy D. wrote:
> Hello everyone,
>
> I am trying to interpolate 2 language models because I want to do an 
> experiment in domain adaption. Below are the commands that I used. 
> When I try to compute lamda, I get the error "mismatch in number of 
> samples (60001 != 67708)". I don't know what to fix...please help me.
>
> ~/local/tools/srilm/bin/i686/ngram -order 3  -unk -lm 
> ~/local/test1/lm/lm1.lm -ppl 
> ~/local/test1/lm/de-en_corpus1.lowercased.en -debug 2 > ppl1.ppl
> ~/local/tools/srilm/bin/i686/ngram -order 3  -unk -lm 
> ~/local/test2/lm/lm2.lm -ppl 
> ~/local/test2/lm/de-en_corpus2.lowercased.en -debug 2 > ppl2.ppl
> ~/local/tools/srilm/bin/i686/compute-best-mix ~/local/test1/ppl1.ppl 
> ~/local/test2/ppl2.ppl

You need to collect ppl1.ppl and ppl2.ppl on the SAME EXACT DATA. Same 
data, different models.  compute-best-mix will find the optimal 
interpolation to minimize the combined model on that data.

Andreas

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20121029/76d66234/attachment.html>