[SRILM User List] why some trigrams are lost?
王秋锋
wqfengnlpr at gmail.com
Thu Jul 22 07:49:18 PDT 2010
Hi,all:
I'm trying to get the trigram from the text with the command:
"ngram-count -text char.txt -lm char.tri -order 3"
and the content of char.txt is: a b a b a d a e
and only one trigram in the result file char.tri,
why other trigram are lost? like "b a b" , "b a d" ,...
the content of char.tri is :
\data\
ngram 1=6
ngram 2=7
ngram 3=1
\1-grams:
-0.9208187 </s>
-99 <s> -0.06445797
-0.3767507 a -0.4313637
-0.6575773 b -0.2405493
-0.9208187 d -0.06445797
-0.9208187 e -0.2455126
\2-grams:
-0.30103 <s> a
-0.39794 a b 0
-0.69897 a d
-0.69897 a e
-0.1760913 b a
-0.30103 d a
-0.30103 e </s>
\3-grams:
-0.1760913 a b a
\end\
2010-07-22
王秋锋
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20100722/bf535330/attachment.html>
More information about the SRILM-User
mailing list