[SRILM User List] ngram pruning
王秋锋
wqfengnlpr at gmail.com
Sat Dec 19 04:19:08 PST 2009
hi all,
I get the original BiGram from the text with ngram-count tool,
like "ngram-count -text corpus -lm Original_BiGram -order 2"
so the original_Bigram is very large, I need pruning, like "ngram -lm Original_BiGram -order 2 -prune... "
But I found that the -prune tool can not prune the UniGram, the -minprune n is at least 2.
So What can I do to prune the Unigram?
because all the words from the corpus are in the Unigram, it is too large, and some words are really useless.
Thanks.
Wang
2009-12-19
王秋锋
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20091219/0798e945/attachment.html>
More information about the SRILM-User
mailing list