[SRILM User List] problems with installation on macosx
Andreas Stolcke
stolcke at icsi.berkeley.edu
Thu May 2 13:08:32 PDT 2013
On 5/1/2013 5:04 PM, Maria Eskevich wrote:
> Dear Andreas,
>
> You were right, I haven't had the gawk installed. Thanks for the help.
>
> Could I please ask another question. If I have a file with lattice in
> HTK format, is it possible to get the 1-best list with corresponding
> timing and probability information?
> As I understood the option -acoustic-mesh should keep this
> information, but I don't see any writing option that would combine the
> 2 things together. Basically I need the 1 best list with
> time/confidence scores information for each word.
>
> Maybe some additional changes with the following command line can help?
> ./lattice-tool -in-lattice file.lat -read-htk -viterbi-decode
> -acoustic-mesh
There is currently no option to dump all the acoustic information
(scores, alignments) out in nbest format, although the information is
available internally.
But for getting the 1-best version of that information there is a
workaround. You can generate confusion networks with a low posterior
scaling factor. That will force the 1-best in the CN to be the same as
the 1-best in a Viterbi decoding. But using the the -acoustic-mesh
option you can then read off the time alignments and score information. Try
lattice-tool -read-htk -in-lattice LATTICEFILE -acoustic-mesh
-write-mesh CNFILE -posterior-scale 0.01
and postprocess CNFILE. It will contain stuff like
align 3 he 1 we 0 ate 0 me 0 h 0 u 0 t 0 you're 0 if_you_have 0 say 0
pete 0 you 0 deep 0 aid 0 you'd 0 is 0 i 0 they 0 c 0 a 0 keep 0 q 0 t.
0 a. 0 these 0 p. 0 oh 0 uh 0 lee 0 hee 0 she 0 really 0 indeed 0 hehe 0
he'd 0 are_you 0 heat 0 eight 0 or 0 but_you 0 to_be 0 uhhuh 0 a._i. 0
[laugh] 0 p 0 it's 0 see 0 but 0 e 0 hate 0 but_he 0 re 0 i_mean 0 neat
0 i_see 0 and_he 0 ee 0 uh_you 0 need 0 yeah_you 0 maybe 0 and_it 0 v 0
okay 0 v. 0 eee 0 do_you 0 e. 0 hes 0 g 0 mm 0 he's 0 easy 0 may 0 any 0
pay 0 if 0 b. 0 they'd 0 you_you 0 hey 0 beep 0 it 0 c. 0 gee 0 if_you 0
be 0 three 0 if_he 0 b 0 is_it 0 eat 0 d 0 d. 0 eighty 0
info 3 he 0.04 0.26 -165.916 -2.62371 :#[hh]iy,0.08:hh[iy]#,0.18: :
info 3 we 0.07 0.23 -160.802 -2.66818 :#[w]iy,0.03:w[iy]#,0.20: :
info 3 ate 0.03 0.27 -173.698 -2.75711 :#[ey]t,0.15:ey[t]#,0.12: :
....
Because of the low posterior scaling, all the posterior probability is
on the 1-best word ("he" in this case). Then you find the "info" record
associated with that word an it will give you the start time, duration,
acoustic and LM scores, and pronunciation and phone durations (the
format is defined in the wlat-format(5) man page).
You can safely add a pruning option if the CN construction takes too
long, since you are only interested in the 1-best output.
Andreas
>
> Best,
> Maria
>
> On 30 Apr 2013, at 23:25, Andreas Stolcke <stolcke at icsi.berkeley.edu
> <mailto:stolcke at icsi.berkeley.edu>> wrote:
>
>> On 4/30/2013 5:02 AM, Maria Eskevich wrote:
>>>
>>> Dear Andreas,
>>>
>>> I downloaded the 1.7 version of SRILM and followed the instruction
>>> for installation (checked with INSTALL file details and
>>> http://www1.icsi.berkeley.edu/~wooters/SRILM/3%20Install(07F18266).html
>>> <http://www1.icsi.berkeley.edu/%7Ewooters/SRILM/3%20Install%2807F18266%29.html>).
>>>
>>> My system is macosx 10.8.3, processor 2.9 Ghz Intel Core i7.
>> The compiler warnings are not a problem. Verify that the binaries
>> in $SRILM/bin/macosx are runnable, e.g., ngram -version.
>> If that's not the case then there is some problem with your compiler
>> or linker and you should shared your complete log output -- hopefully
>> some macosx expert can help.
>>
>> The tests could be failing because you don't have gawk installed.
>>
>> Andreas
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20130502/8e831624/attachment.html>
More information about the SRILM-User
mailing list