[SRILM User List] class based language model

Andreas Stolcke stolcke at icsi.berkeley.edu
Wed Jun 6 14:27:08 PDT 2012


On 6/6/2012 3:36 AM, Shammur Absar Chowdhury wrote:
> Thank You sir for your help.
>
> I have actually another very silly question.
> After I get the probability distribution over word, I build another 
> language model and when I try to find difference between my previous 
> LM (where I used my class definition with no [p] value) and my 
> recently created LM , I found no difference.
>
> I might have a understanding problem in basic theory [as just read 
> about it in books] or am I doing anything wrong in the step.
>
> My recent steps that I am following:
>
> [1] replace-words-with-classes classes=atis_sphinx.def addone=1 
> normalize=1 outfile=countExpansion compound_LM.txt
>
> [2] replace-words-with-classes classes=countExpansion compound_LM.txt 
> > output_text_with_classes
Verify the output of these two steps.   Do the class definitions and 
modified text look okay?
>
> [3] ngram-count  -text output_text_with_classes 
> classes=countExpansion   -lm class_based_model_2.lm
>
> also tried  ngram-count  -text output_text_with_classes   -lm 
> class_based_model_2.lm
The second form is correct.   There is no need to specify the class 
definitions with ngram-count.

You should be able to use the final LM using

     ngram -lm class_based_model_2.lm -classes countExpansion  (...other 
options ...)

Andreas



More information about the SRILM-User mailing list