ngram-class with -incremental + -save-maxclasses
Matt Lease
mlease at cs.brown.edu
Fri Mar 28 12:03:25 PDT 2008
What is the behavior of -save-maxclasses for ngram-class when
-incremental is used? My understanding of -incremental is that C as
specified by -numclasses determines the number of classes for the entire
run-time (i.e. C+1 for the new word being merged into the existing C
classes), in which case -save-maxclasses would seem not to add anything
(ie perhaps it's only intended for V^3 clustering).
If one wanted to get different clusterings with the greedy algorithm
without re-running each from scratch, it looks like you can use the
-class-counts option and then feed this counts file into a subsequent
invocation of ngram-class. For example, run it initially with C=1000,
then feed the output class counts into a second invocation with C=500,
say. Is this the correct procedure?
Thanks!
More information about the SRILM-User
mailing list