[SRILM User List] ngram pruning

王秋锋 wqfengnlpr at gmail.com
Sat Dec 19 04:19:08 PST 2009


hi all,
 I get the original BiGram from the text with ngram-count tool,
like "ngram-count -text  corpus  -lm Original_BiGram  -order 2"
so the original_Bigram is very large, I need pruning, like "ngram -lm Original_BiGram -order 2 -prune... "
But I found that the -prune tool can not prune the UniGram, the -minprune n is at least 2.
So What can I do to prune the Unigram?
because all the words from the corpus are in the Unigram, it is too large, and some words are really useless. 

  Thanks.

Wang

2009-12-19 



王秋锋 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20091219/0798e945/attachment.html>


More information about the SRILM-User mailing list