SRILM 1.5.6 released
Andreas Stolcke
stolcke at speech.sri.com
Wed Jan 2 09:49:07 PST 2008
Happy New Year!
The latest version of SRILM is available from
http://www.speech.sri.com/projects/srilm/download.html .
This release features much enhanced support for server-based LMs and
improved documentation.
Enjoy,
Andreas
-----------------------------------------------------------------------------
1.5.6 2 January 2008
Functionality:
* New ngram -use-server option to run the client side of a network LM
server as implemented by ngram -server-port. Optionally, probabilities
may be cached in the client (option -cache-served-ngrams).
Mixtures of one or more network and file-based LMs are also possible.
* Likewise, disambig, hidden-gram, and lattice-tool understand the
-use-server option.
* New LMClient class to implement the above (a stub LM subclass that
queries a server for LM probabilities).
* ngram -server-port now behaves like a true server daemon: it handles
multiple simultaneous or sequential clients, and never exits (unless
killed). The number of simultaneous clients may be limited with the
-server-maxclients option.
* Support for 7-zip compressed files (suggested by Alexy Khrabrov).
* lattice-tool -split-multiwords will now print a warning message
about multiwords that were not split because their LM probability was
non-zero.
* LoglinearMix LM class supports n-way mixtures directly, giving more
efficient implementation for n > 2 than recursive object construction
in ngram (contributed by Tanel Alumae).
Bug fixes:
* MultiwordLM now implicitly adds all words to the vocabulary, so that
previously unseen multiwords get split. This has the side effect that
OOVs will appear as zeroprob words.
Documentation:
* The doc/FAQ file has been expanded and reformated as a man page.
It can be viewed with "man srilm-faq" or online at
http://www.speech.sri.com/projects/srilm/manpages/srilm-faq.html .
The major content additions are questions about the build
process, how to build a "Google N-gram LM", smoothing issues,
and OOV-handling (the latter by Deniz Yuret). Corrections and
additions to this document are most welcome!
* A new manual page ngram-discount(7) gives a detailed overview of
smoothing methods found in SRILM (contributed by Deniz Yuret).
* The conversion of man pages to html has been enhanced to better
handle code samples and nested itemized lists.
More information about the SRILM-User
mailing list