[SRILM User List] Class-based probability using -expand-classes

Andreas Stolcke stolcke at icsi.berkeley.edu
Wed May 16 14:33:23 PDT 2018


I'm not sure what "an approximate inverse to the original language 
model" means.

But the purpose of ngram -expand-classes is to approximate the class LM 
probabilities (the equation you give) using only word ngram 
probabilities.  It does so by inserting all expanded word ngrams into 
the LM and giving them probabilities according to

P(w_n | w_n-1 ... w_1) = P(w_n w_n-1 ... w_1) / P(w_n-1 ... w_1)

where the joint probabilities on the right hand side are computed by the 
class-based LM.

Andreas

On 5/16/2018 1:35 PM, Jonathan Mendoza wrote:
> SRILM community,
>
> If I build a class based LM via
>
> replace-words-with-classes -> ngram
>
> then re-build the LM using -expand-classes,
>
> will the rebuilt LM follow the class based probabilities,
>
> P(w_n | w_n-1 ... w_1) ?= P(w_n | c_n) * P(c_n | c_n-1 ... c_1).
>
> Or is the mapping an approximate inverse to the original language model?
>
>
> _______________________________________________
> SRILM-User site list
> SRILM-User at speech.sri.com
> http://mailman.speech.sri.com/cgi-bin/mailman/listinfo/srilm-user


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.speech.sri.com/pipermail/srilm-user/attachments/20180516/032ec1cb/attachment.html>


More information about the SRILM-User mailing list