parallel ngram-count

Alexy Khrabrov deliverable at
Thu Nov 1 13:56:35 PDT 2007

I see one quick way to parallelize ngram-count on a N-core box:

-- split file list into N sublists
-- launch N ngram-count instances, giving each its own sublist
-- merge counts

Is there any better way?

