[SRILM User List] about FAQ B2

Andreas Stolcke stolcke at speech.sri.com
Tue Dec 29 22:08:37 PST 2009


On 12/29/2009 4:55 AM, 林志誠 wrote:
>
> hello everyone.
> i have rty to use "ngram-count" to process a file.(3-GRAM)
> but the file is too big to my computer.
> on the FAQ B2. is it mean i can divide the file to smaller files to run?
>
> if the answer is Yes. how can i use the two scripts?
> make-batch-counts
> merge-batch-counts
>
> in
> http://www.speech.sri.com/projects/srilm/manpages/training-scripts.1.html
>
> the SYNOPSIS i can't understand.
>
> if i want to use " ngram-count -text thefile.txt -write output.cnt
> -order 3" before.
>
> and i divide the file to 10 pert and write a list like below named
> thelist.txt
> thefile1
> thefile2
> thefile3
> ....
> thefile10
>
> how can i use the two scripts to output .cnt file just like ngram-count?
>
Don't just read the synopsis of the man page. Check the paragraphs
describing the make-batch-counts and merge-batch-counts command in the
DESCRIPTION section. You can also see an example of their use in
$SRILM//test/tests/merge-batch-counts/run-test (after the "split"command).

Andreas


>
>
>
> _______________________________________________
> SRILM-User site list
> SRILM-User at speech.sri.com
> http://www.speech.sri.com/mailman/listinfo/srilm-user

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20091229/dc82a996/attachment.html>


More information about the SRILM-User mailing list