[SRILM User List] finding likely substitutes quickly

Deniz Yuret dyuret at ku.edu.tr
Sun Oct 7 01:05:31 PDT 2012


Dear SRILM users,

I have developed an algorithm (FASTSUBS) that can generate the most
likely word substitutes from an n-gram model fast.  We have used
FASTSUBS to achieve state of the art results in unsupervised part of
speech induction in EMNLP 2012.  The paper, the code, and a dataset
with the top 100 substitutes of each token in the WSJ section of the
Penn Treebank are available at http://goo.gl/jzKH0.

best,
deniz


More information about the SRILM-User mailing list