cahce based models
Andreas Stolcke
stolcke at speech.sri.com
Wed Mar 7 17:27:24 PST 2007
In message <3BE78265-2376-4D96-8AB4-547D82E15E92 at gmail.com>you wrote:
> Hi Hani,
>
> if I'm correctly interpreting your question, the LM subclass CacheLM
> provides a simple cache component implementation.
>
> Word probability is boosted if the very same word occured in a window
> of the last N words (more occurencies yield higher probability). You
> get ngram to interpolate whatever model you're using with a cache
> component using -cache. The source code of this one is very
> straightforward if you're interested in the details.
>
> If you're looking for the original papers, Kuhn and De Mori published
> on this in 1990 (as to my knowledge at least).
>
> Hope this helps.
>
> Cheers from Aachen,
>
> Juri
Thanks for this dead-on response!
At risk of stating the obvious, the code for CacheLM is in
$SRILM/lm/src/CacheLM.cc, and is quite short and easy to follow.
Best,
Andreas
>
> On 8. Mar, 2007, at 01:17, Hani Safadi wrote:
>
> > Hi,
> > I would like to get more information on the cache-based models
> > implemented in SRILM. and how to use them.
> > The paper briefly mentions them, and there is no information in the
> > man pages.
> > Thanks
> > --
> > Looking forward to hearing from you.
> > Best wishes,
> > Hani Safadi
>
More information about the SRILM-User
mailing list