[SRILM User List] problems with installation on macosx

Andreas Stolcke stolcke at icsi.berkeley.edu
Thu May 2 13:08:32 PDT 2013


On 5/1/2013 5:04 PM, Maria Eskevich wrote:
> Dear Andreas,
>
> You were right, I haven't had the gawk installed. Thanks for the help.
>
> Could I please ask another question. If I have a file with lattice in 
> HTK format, is it possible to get the 1-best list with corresponding 
> timing and probability information?
> As I understood the option -acoustic-mesh should keep this 
> information, but I don't see any writing option that would combine the 
> 2 things together. Basically I need the 1 best list with 
> time/confidence scores information for each word.
>
> Maybe some additional changes with the following command line can help?
> ./lattice-tool -in-lattice file.lat -read-htk  -viterbi-decode 
> -acoustic-mesh

There is currently no option to dump all the acoustic information 
(scores, alignments) out in nbest format, although the information is 
available internally.

But for getting the 1-best version of that information there is a 
workaround.   You can generate confusion networks with a low posterior 
scaling factor.  That will force the 1-best in the CN to be the same as 
the 1-best in a  Viterbi decoding.   But using the the -acoustic-mesh 
option you can then read off the time alignments and score information.  Try

     lattice-tool -read-htk -in-lattice LATTICEFILE  -acoustic-mesh 
-write-mesh CNFILE -posterior-scale 0.01

and postprocess CNFILE.  It will contain stuff like

align 3 he 1 we 0 ate 0 me 0 h 0 u 0 t 0 you're 0 if_you_have 0 say 0 
pete 0 you 0 deep 0 aid 0 you'd 0 is 0 i 0 they 0 c 0 a 0 keep 0 q 0 t. 
0 a. 0 these 0 p. 0 oh 0 uh 0 lee 0 hee 0 she 0 really 0 indeed 0 hehe 0 
he'd 0 are_you 0 heat 0 eight 0 or 0 but_you 0 to_be 0 uhhuh 0 a._i. 0 
[laugh] 0 p 0 it's 0 see 0 but 0 e 0 hate 0 but_he 0 re 0 i_mean 0 neat 
0 i_see 0 and_he 0 ee 0 uh_you 0 need 0 yeah_you 0 maybe 0 and_it 0 v 0 
okay 0 v. 0 eee 0 do_you 0 e. 0 hes 0 g 0 mm 0 he's 0 easy 0 may 0 any 0 
pay 0 if 0 b. 0 they'd 0 you_you 0 hey 0 beep 0 it 0 c. 0 gee 0 if_you 0 
be 0 three 0 if_he 0 b 0 is_it 0 eat 0 d 0 d. 0 eighty 0
info 3 he 0.04 0.26 -165.916 -2.62371 :#[hh]iy,0.08:hh[iy]#,0.18: :
info 3 we 0.07 0.23 -160.802 -2.66818 :#[w]iy,0.03:w[iy]#,0.20: :
info 3 ate 0.03 0.27 -173.698 -2.75711 :#[ey]t,0.15:ey[t]#,0.12: :
....

Because of the low posterior scaling, all the posterior probability is 
on the 1-best word ("he" in this case).  Then you find the "info" record 
associated with that word an it will give you the start time, duration, 
acoustic and LM scores, and pronunciation and phone durations (the 
format is defined in the wlat-format(5) man page).

You can safely add a pruning option if the CN construction takes too 
long, since you are only interested in the 1-best output.

Andreas





>
> Best,
> Maria
>
> On 30 Apr 2013, at 23:25, Andreas Stolcke <stolcke at icsi.berkeley.edu 
> <mailto:stolcke at icsi.berkeley.edu>> wrote:
>
>> On 4/30/2013 5:02 AM, Maria Eskevich wrote:
>>>
>>> Dear Andreas,
>>>
>>> I downloaded the 1.7 version of SRILM and followed the instruction 
>>> for installation (checked with INSTALL file details and 
>>> http://www1.icsi.berkeley.edu/~wooters/SRILM/3%20Install(07F18266).html 
>>> <http://www1.icsi.berkeley.edu/%7Ewooters/SRILM/3%20Install%2807F18266%29.html>).
>>>
>>> My system is macosx 10.8.3, processor 2.9 Ghz Intel Core i7.
>> The compiler warnings are not a problem.   Verify that the binaries 
>> in $SRILM/bin/macosx are runnable, e.g., ngram -version.
>> If that's not the case then there is some problem with your compiler 
>> or linker and you should shared your complete log output -- hopefully 
>> some macosx expert can help.
>>
>> The tests could be failing because you don't have gawk installed.
>>
>> Andreas
>>
>>
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20130502/8e831624/attachment.html>


More information about the SRILM-User mailing list