[SRILM User List] ngram-count hangs and other problems
E
otheremailid at aol.com
Thu Oct 10 05:37:34 PDT 2013
Thanks!
>1) Make sure you're building 64-bit executables. If "file bin/i686/ngram-count" says that it's an 32-bit >executable, do a "make clean" and rebuilt with "make MACHINE_TYPE=i686-m64 ..." .
This worked. I had to use "make OPTION=_l" though. Now there is no problem of ngrams with positive log probability.
But when I run below command-
bin/i686_l/ngram-count -order 1 -vocab wordList -read ngramCounts -lm ug.lm -wbdiscount1
The memory usage is not much (~ 5mb) but the CPU usage is in high 90's. I tried your suggestion to scale down data. Just used 100 unigrams and the *.lm file was created within minutes.
And for the complete data, using -wbdiscount took about 2 hours.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20131010/3d18dc6e/attachment.html>
More information about the SRILM-User
mailing list