Problems finding best path (to choose synonynm)

Ehud Reiter ereiter at csd.abdn.ac.uk
Tue Sep 6 06:07:16 PDT 2005


I'm trying to use srilm for a Natural Language Generation
application, to choose between synonymns of a word.  The input
to a system is a structure such as

  you OR[answered,got] 4 questions OR[correctly,correct,right]

The system needs to make a choice at each OR point, with the
goal of producing the easiest-to-read final sentence.  There are
preference weights for the choices, for example, "answered"
gets a preference weight of 0.2 and "got" gets 0.8, this reflects
the fact that even ignoring LM issues we expect "got"
to be easier to read (shorter, simple phoneme->letter mapping)

I represent the above as a "wlat" format file, which I convert
to pfsg and then run lattice-tool on.  However, I can't get
lattice-tool to find the best path through the mesh taking into
account both the language model and the preference weights.
If I specify -viterbi-decode  I get the best path based on the
LM (but ignoring the preference scores), while if I specify
-posterior-decode I get the best path based on preference scores
(but ignoring the LM).  I'd also like to see the actual scores,
I thought I would get this with -nbest-decode but the nbest file
has 0 for all the scores.

Is there any way to find the best path taking both LM and
preference weights into consideration, and giving actual
scores?

Many thanks
					Ehud Reiter




More information about the SRILM-User mailing list