[SRILM User List] WBDiscount backoff weights

Andreas Stolcke stolcke at icsi.berkeley.edu
Tue Aug 7 11:14:55 PDT 2012


On 8/7/2012 10:32 AM, shinichiro.hamada wrote:
> Hi.
>
> I did a small test described as below to understand SRILM behavior
> of WBDiscount backoff weights (bow), and got a question.
>
> The values of bows of "<s> context", "context word1", "context
> word2" (2grams) are set to zero. Why?
>
> They are the prefix of "<s> context word1" (or "<s> context word2"),
> "context word1 </s>", "context word2 </s>" respetively, so I think
> they are qualified to have bow values.
>
> I read the explanation of WBDiscount and "Warning5" in the
> ngram-discount manual (*1), but I couln't get it's answer.
>
> Any advices will help me very much. Thank you.

Backoff log weight zero (= 1 in the probability domain) means that the 
bigram probs don't need to be modified when used for backoff purposes.   
This is because, in your example, the probability mass left over from 
the explicit trigrams is the same as the probability mass of the 
corresponding bigrams.   And this, in turn, is because your trigrams

-0.60206	<s> context word1
-0.60206	<s> context word2

have the same probabilities as the corresponding bigrams:

-0.60206    context word1    0
-0.60206    context word2    0

So there is nothing mysterious going on, it just happens to follow from 
the bigram and trigrams in your data.  You will not likely find this 
situation in realistic data sets.

Andreas




>
> (*1) ngram-discount manual
> http://www-speech.sri.com/projects/srilm/manpages/ngram-discount.7.html
>
>
> ----------------------------------------------------------------------
> $ cat > smp.txt << EOF
> context word1
> context word2
> EOF
> $ ngram-count -order 3 -wbdiscount -text smp.txt -gtmin 0 -gt1min0 -gt2min 0
> -gt3min 0 -lm lm.arpa
> $ cat lm.arpa
>
> \data\
> ngram 1=5
> ngram 2=5
> ngram 3=4
>
> \1-grams:
> -0.5228788	</s>
> -99	<s>	-0.3222193
> -0.5228788	context -0.07918124
> -0.69897	word1	-0.146128
> -0.69897	word2	-0.146128
>
> \2-grams:
> -0.1760913	<s> context	0
> -0.60206	context word1	0
> -0.60206	context word2	0
> -0.30103	word1 </s>
> -0.30103	word2 </s>
>
> \3-grams:
> -0.60206	<s> context word1
> -0.60206	<s> context word2
> -0.30103	context word1 </s>
> -0.30103	context word2 </s>
>
> \end\
>
> --
> Shinichiro Hamada
>
> _______________________________________________
> SRILM-User site list
> SRILM-User at speech.sri.com
> http://www.speech.sri.com/mailman/listinfo/srilm-user



More information about the SRILM-User mailing list