[SRILM User List] ignore xml markup or other annotations

Andreas Stolcke stolcke at icsi.berkeley.edu
Thu Jun 12 10:38:31 PDT 2014


On 6/12/2014 7:53 AM, kamel nebhi wrote:
> Dear all,
>
> i'm actually using the disambig tool and i want to know if it's 
> possible to ignore some xml markup or annotations during the process. 
> For example, if i have this sentence:
> /I actually live in <PERS> Paris </PERS>/.
> i want to ignore the /PERS/ tags but i need to keep it for my evaluation.
>
SRILM does not do text processing because it is too 
application-dependent.  Instead most tools support readining/writing 
to/from stdin/stdout, so you can assemble a pipeline that combines text 
processing and SRILM tools.

Andreas

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20140612/dc8f697c/attachment.html>


More information about the SRILM-User mailing list