[SRILM User List] class based language model s

Andreas Stolcke stolcke at icsi.berkeley.edu
Tue Dec 3 14:48:57 PST 2013


On 12/3/2013 12:50 PM, Laatar Rim wrote:
> in the class format:
> /class/  [/p/]/word1/  /word2/  ...
> how can i calculate p ?

Use replace-words-with-classes with the outfile=  option.   This is 
explained in a previous post 
<http://www.speech.sri.com/pipermail/srilm-user/2007q2/000445.html>.

Andreas

>
> Le Mardi 3 décembre 2013 18h18, Andreas Stolcke 
> <stolcke at icsi.berkeley.edu> a écrit :
> On 12/3/2013 1:35 AM, Laatar Rim wrote:
>> hello,
>>
>> on the internet I found this:
>>  to build and use a simple class language model:
>> Induce classes:
>> ngram-class -vocab vocab_file \
>>              -text input_file \
>>              -numclasses num \
>>              -class-counts output.class-counts \
>>              -classes output.classes
>> in this exemple we need only number of class, how can i use corpus of class ???
> The steps for building a class-based LM are:
>
> 1. prepare class definition file in the format described in the
>    classes-format(5) manual page.  this can be done by hand or from other
>    knowledge sources, or automatically using word clustering algorithms
>    (see ngram-class(1)).
>
> 2. condition the training data or counts to replace words with class 
> labels,
>    using the "replace-words-with-classes" filter (see training-scripts(1)
>    man page).
>
> 3. run ngram-count on the result of step 2.
>
>
> Andreas
>
>
>
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20131203/bff1bf2b/attachment.html>


More information about the SRILM-User mailing list