cahce based models

Andreas Stolcke stolcke at speech.sri.com
Wed Mar 7 17:27:24 PST 2007


In message <3BE78265-2376-4D96-8AB4-547D82E15E92 at gmail.com>you wrote:
> Hi Hani,
> 
> if I'm correctly interpreting your question, the LM subclass CacheLM  
> provides a simple cache component implementation.
> 
> Word probability is boosted if the very same word occured in a window  
> of the last N words (more occurencies yield higher probability). You  
> get ngram to interpolate whatever model you're using with a cache  
> component using -cache. The source code of this one is very  
> straightforward if you're interested in the details.
> 
> If you're looking for the original papers, Kuhn and De Mori published  
> on this in 1990 (as to my knowledge at least).
> 
> Hope this helps.
> 
> Cheers from Aachen,
> 
> Juri

Thanks for this dead-on response!

At risk of stating the obvious, the code for CacheLM is in
$SRILM/lm/src/CacheLM.cc, and is quite short and easy to follow.

Best,

Andreas 

> 
> On 8. Mar, 2007, at 01:17, Hani Safadi wrote:
> 
> > Hi,
> > I would like to get more information on the cache-based models
> > implemented in SRILM. and how to use them.
> > The paper briefly mentions them, and there is no information in the  
> > man pages.
> > Thanks
> > -- 
> > Looking forward to hearing from you.
> > Best wishes,
> > Hani Safadi
> 




More information about the SRILM-User mailing list