[SRILM User List] WBDiscount backoff weights
Andreas Stolcke
stolcke at icsi.berkeley.edu
Tue Aug 7 11:14:55 PDT 2012
On 8/7/2012 10:32 AM, shinichiro.hamada wrote:
> Hi.
>
> I did a small test described as below to understand SRILM behavior
> of WBDiscount backoff weights (bow), and got a question.
>
> The values of bows of "<s> context", "context word1", "context
> word2" (2grams) are set to zero. Why?
>
> They are the prefix of "<s> context word1" (or "<s> context word2"),
> "context word1 </s>", "context word2 </s>" respetively, so I think
> they are qualified to have bow values.
>
> I read the explanation of WBDiscount and "Warning5" in the
> ngram-discount manual (*1), but I couln't get it's answer.
>
> Any advices will help me very much. Thank you.
Backoff log weight zero (= 1 in the probability domain) means that the
bigram probs don't need to be modified when used for backoff purposes.
This is because, in your example, the probability mass left over from
the explicit trigrams is the same as the probability mass of the
corresponding bigrams. And this, in turn, is because your trigrams
-0.60206 <s> context word1
-0.60206 <s> context word2
have the same probabilities as the corresponding bigrams:
-0.60206 context word1 0
-0.60206 context word2 0
So there is nothing mysterious going on, it just happens to follow from
the bigram and trigrams in your data. You will not likely find this
situation in realistic data sets.
Andreas
>
> (*1) ngram-discount manual
> http://www-speech.sri.com/projects/srilm/manpages/ngram-discount.7.html
>
>
> ----------------------------------------------------------------------
> $ cat > smp.txt << EOF
> context word1
> context word2
> EOF
> $ ngram-count -order 3 -wbdiscount -text smp.txt -gtmin 0 -gt1min0 -gt2min 0
> -gt3min 0 -lm lm.arpa
> $ cat lm.arpa
>
> \data\
> ngram 1=5
> ngram 2=5
> ngram 3=4
>
> \1-grams:
> -0.5228788 </s>
> -99 <s> -0.3222193
> -0.5228788 context -0.07918124
> -0.69897 word1 -0.146128
> -0.69897 word2 -0.146128
>
> \2-grams:
> -0.1760913 <s> context 0
> -0.60206 context word1 0
> -0.60206 context word2 0
> -0.30103 word1 </s>
> -0.30103 word2 </s>
>
> \3-grams:
> -0.60206 <s> context word1
> -0.60206 <s> context word2
> -0.30103 context word1 </s>
> -0.30103 context word2 </s>
>
> \end\
>
> --
> Shinichiro Hamada
>
> _______________________________________________
> SRILM-User site list
> SRILM-User at speech.sri.com
> http://www.speech.sri.com/mailman/listinfo/srilm-user
More information about the SRILM-User
mailing list