[SRILM User List] Data preparation for building language model using ngram-count
Andreas Stolcke
stolcke at speech.sri.com
Fri Jan 15 20:47:38 PST 2010
On 1/15/2010 8:30 PM, Abbas Malik wrote:
> Dear All,
>
> Do we really need to add <s> at the start of each sentence and </s> at
> the end of each sentence for the preparation of a language model using
> ngram-count.
>
> my data looks like:
>
> =============
> <s> sentnce1 </s>
> <s> sentence2 </s>
> so on...
> =============
>
> De we really need <s> and </s> tags?
No. It is done automatically by ngram-count .
Andreas
More information about the SRILM-User
mailing list