Hi, I was working on the reuters rcv1 corpus and while investigating a discrepancy in the language model output I realized that the ngram command skips lines in the test file that start with '##'. Is this a documented feature or a bug? best, deniz