nbest-rover-acoustic test failing

Andreas Stolcke stolcke at speech.sri.com
Fri Sep 14 20:29:14 PDT 2007


David Gelbart wrote:
> Hi,
>
> I have built SRILM 1.5.3 under Fedora Core 3 and Ubuntu 6.06. The 
> nbest-rover-acoustic test fails for me because stdout differs from the 
> reference output.
>
> Below, I have included the beginning of my output and the reference 
> output.  On line 10, puh_f and pum_f in the reference output are 
> replaced with puh and pum in my output.  On line 16, 0.958381 in the 
> reference output is replaced with 0.958202 in my output, and similarly 
> for several of the other numbers.  The same kind of differences 
> (sometimes missing _f after phone name, and sometimes slightly 
> different numbers) continue later on in my output, and there are also 
> cases where different words are recognized.  I have placed the full 
> outputs at http://www.icsi.berkeley.edu/~gelbart/sriTest.tar
>
> Does anyone have suggestions about what might be causing this? I have 
> set LANG=C, LC_NUMERIC=C, and LC_ALL=C.
It's a bug in the reference output.  There was an update to the handling 
of phone labels with diacritics ("_f")  in nbest-rover-acoustic, in 
release 1.5.3, but I never regenerated the reference output for this test.

Your output is in fact correct.  If you want you can download 1.5.4-beta 
and grab the reference output in it.

Andreas

>
> The beginning of my output, with line numbers:
>
> [root at localhost test]# head -16 
> output/nbest-rover-acoustic.i686.stdout | cat -n
>     1  name sw_40008_A_0015814_0016128
>     2  numaligns 16
>     3  posterior 1
>     4  align 0 *DELETE* 0.999999 uhhuh 1.28131e-06
>     5  reference 0 *DELETE*
>     6  info 0 uhhuh 0.55 0.54 -401.471 -14.9418 m:hh:pum 3:43:8
>     7  align 1 uhhuh 0.998465 um 0.00128149 uh 0.000207527 huh 
> 4.63062e-05 *DELETE* 0
>     8  reference 1 *DELETE*
>     9  info 1 uhhuh 0.55 0.54 -401.471 -14.9418 m:hh:pum 3:43:8
>    10  info 1 um 0.58 0.21 -154.354 -13.163 puh:pum 18:3
>    11  info 1 uh 0.58 0.43 -308.707 -17.7878 puh 43
>    12  info 1 huh 0.57 0.22 -170.807 -17.7878 hh:pum 12:10
>    13  align 2 *DELETE* 1 [laugh] 9.69422e-10
>    14  reference 2 *DELETE*
>    15  info 2 [laugh] 1.08 0.17 -160.491 -18.1436 lau:lau 14:3
>    16  align 3 *DELETE* 0.958202 [mouth] 0.0325037 uhhuh 0.00557467 
> [laugh] 0.00342208 [noise] 0.000275018 yeah 1.44687e-05 is 4.34731e-06 
> oh 3.83087e-06 huh 1.56753e-07 @reject@ 1.10134e-12 it 9.69716e-13
>
> The beginning of the reference output, with line numbers:
>
> [root at localhost test]# head -16 reference/nbest-rover-acoustic.stdout 
> | cat -n
>     1  name sw_40008_A_0015814_0016128
>     2  numaligns 16
>     3  posterior 1
>     4  align 0 *DELETE* 0.999999 uhhuh 1.28131e-06
>     5  reference 0 *DELETE*
>     6  info 0 uhhuh 0.55 0.54 -401.471 -14.9418 m:hh:pum 3:43:8
>     7  align 1 uhhuh 0.998465 um 0.00128149 uh 0.000207527 huh 
> 4.63062e-05 *DELETE* 0
>     8  reference 1 *DELETE*
>     9  info 1 uhhuh 0.55 0.54 -401.471 -14.9418 m:hh:pum 3:43:8
>    10  info 1 um 0.58 0.21 -154.354 -13.163 puh_f:pum_f 18:3
>    11  info 1 uh 0.58 0.43 -308.707 -17.7878 puh 43
>    12  info 1 huh 0.57 0.22 -170.807 -17.7878 hh:pum 12:10
>    13  align 2 *DELETE* 1 [laugh] 9.69422e-10
>    14  reference 2 *DELETE*
>    15  info 2 [laugh] 1.08 0.17 -160.491 -18.1436 lau:lau 14:3
>    16  align 3 *DELETE* 0.958381 [mouth] 0.0323282 uhhuh 0.00557468 
> [laugh] 0.00341845 [noise] 0.000274616 yeah 1.44687e-05 is 4.34731e-06 
> oh 3.83087e-06 huh 1.56753e-07 @reject@ 1.10134e-12 it 9.69716e-13
>
> Thanks,
> David





More information about the SRILM-User mailing list