[SRILM User List] lm interpolation
Andreas Stolcke
stolcke at icsi.berkeley.edu
Mon Oct 29 12:46:33 PDT 2012
On 10/29/2012 9:15 AM, Stefy D. wrote:
> Hello everyone,
>
> I am trying to interpolate 2 language models because I want to do an
> experiment in domain adaption. Below are the commands that I used.
> When I try to compute lamda, I get the error "mismatch in number of
> samples (60001 != 67708)". I don't know what to fix...please help me.
>
> ~/local/tools/srilm/bin/i686/ngram -order 3 -unk -lm
> ~/local/test1/lm/lm1.lm -ppl
> ~/local/test1/lm/de-en_corpus1.lowercased.en -debug 2 > ppl1.ppl
> ~/local/tools/srilm/bin/i686/ngram -order 3 -unk -lm
> ~/local/test2/lm/lm2.lm -ppl
> ~/local/test2/lm/de-en_corpus2.lowercased.en -debug 2 > ppl2.ppl
> ~/local/tools/srilm/bin/i686/compute-best-mix ~/local/test1/ppl1.ppl
> ~/local/test2/ppl2.ppl
You need to collect ppl1.ppl and ppl2.ppl on the SAME EXACT DATA. Same
data, different models. compute-best-mix will find the optimal
interpolation to minimize the combined model on that data.
Andreas
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20121029/76d66234/attachment.html>
More information about the SRILM-User
mailing list