[SRILM User List] ignore xml markup or other annotations
Andreas Stolcke
stolcke at icsi.berkeley.edu
Thu Jun 12 10:38:31 PDT 2014
On 6/12/2014 7:53 AM, kamel nebhi wrote:
> Dear all,
>
> i'm actually using the disambig tool and i want to know if it's
> possible to ignore some xml markup or annotations during the process.
> For example, if i have this sentence:
> /I actually live in <PERS> Paris </PERS>/.
> i want to ignore the /PERS/ tags but i need to keep it for my evaluation.
>
SRILM does not do text processing because it is too
application-dependent. Instead most tools support readining/writing
to/from stdin/stdout, so you can assemble a pipeline that combines text
processing and SRILM tools.
Andreas
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20140612/dc8f697c/attachment.html>
More information about the SRILM-User
mailing list