I see one quick way to parallelize ngram-count on a N-core box: -- split file list into N sublists -- launch N ngram-count instances, giving each its own sublist -- merge counts Is there any better way? Cheers, Alexy