[SRILM User List] SRILM bigram VS HTK bigram
Andreas Stolcke
stolcke at icsi.berkeley.edu
Wed Jun 15 11:07:12 PDT 2011
Mehdi hoseini wrote:
> hi
> I made a bigram language model on a small document with both HTK
> toolkit and SRILM toolkit.
> but unfortunately their accuracy in HTK based ASR are so different. I
> mean SRILM bigram has 10 percent accuracy lower than one with HTK.
> Can you help me where is my mistake?
>
> Here is my command for build bigram in SRILM:
> ngram-count -text sentences.txt -order 2 -wbdiscount 1 -wbdiscount
> 2 -lm bigram.txt
> sentences.txt has 405 sentences.
>
> I build my acoustic model based on left to right HMMs with 2 Gaussian
> mixture for triphones using HTK.
> Someone said sentences.txt and i build my model on that.
It is likely that HTK requires some mapping of vocabulary items for
begin/end of sentence.
Also, are you sure that the smoothing method used by the HTK LM tools
are the same as you did with SRILM?
I don't really have experience building LMs for HTK, so you should
inquire on the HTK user forum about this. I know there are plenty of
people using SRILM in conjunction with HTK.
Andreas
>
> Best Regards
>
More information about the SRILM-User
mailing list