<html>
  <head>
    <meta http-equiv="Content-Type" content="text/html;
      charset=windows-1252">
  </head>
  <body text="#000000" bgcolor="#FFFFFF">
    <div class="moz-cite-prefix">You are correct, -renorm normalizes the
      model assuming the probabilities for each history sum up to <=
      1.</div>
    <div class="moz-cite-prefix">There is no option to rescale the ngram
      probabilities themselves.</div>
    <div class="moz-cite-prefix"><br>
    </div>
    <div class="moz-cite-prefix">However, you are already doing your own
      processing to transfer the NN outputs to the ngram model format. 
      It would be trivial to add a normalization step that sums them up
      (for each history), and rescales them if the sum is > 1.</div>
    <div class="moz-cite-prefix"><br>
    </div>
    <div class="moz-cite-prefix">The more serious question is, how much
      probability mass should you allocate to unseen ngrams?  If the NN
      estimates probabilities that sum to 1 you have a normalized model,
      but not a very good one because it doesn't anticipate ever seeing
      a word that you haven't already seen in that context.  So you
      should find a way to estimate the "unseen word" probability in
      your framework, and then include that in your normalization step.</div>
    <div class="moz-cite-prefix"><br>
    </div>
    <div class="moz-cite-prefix">Andreas<br>
    </div>
    <div class="moz-cite-prefix"><br>
    </div>
    <div class="moz-cite-prefix">On 8/24/2019 2:31 PM, Van der Merwe, W,
      Mnr [<a class="moz-txt-link-abbreviated" href="mailto:20076223@sun.ac.za">20076223@sun.ac.za</a>] wrote:<br>
    </div>
    <blockquote type="cite"
cite="mid:VI1PR07MB58542DF06EA09900F9E442DE8BA70@VI1PR07MB5854.eurprd07.prod.outlook.com">
      <meta http-equiv="Content-Type" content="text/html;
        charset=windows-1252">
      <style type="text/css" style="display:none;"> P {margin-top:0;margin-bottom:0;} </style>
      <div style="font-family: Calibri, Arial, Helvetica, sans-serif;
        font-size: 12pt; color: rgb(0, 0, 0);">
        <div style="margin: 0px; font-size: 12pt; font-family: Calibri,
          Arial, Helvetica, sans-serif; background-color: rgb(255, 255,
          255)">
          Hi,</div>
        <div style="margin: 0px; font-size: 12pt; font-family: Calibri,
          Arial, Helvetica, sans-serif; background-color: rgb(255, 255,
          255)">
          <br>
        </div>
        <div style="margin: 0px; font-size: 12pt; font-family: Calibri,
          Arial, Helvetica, sans-serif; background-color: rgb(255, 255,
          255)">
          I am a student at Stellenbosch University currently using the
          SRILM toolkit for one of my projects. I would like to know if
          the toolkit is able to renormalize the probabilities, given an
          ARPA file, so that they sum to 1. I've read the documentation
          and am aware of the -renorm parameter option, however, I am
          not seeking to renormalize backoff weights, only the
          probabilities.</div>
        <div style="margin: 0px; font-size: 12pt; font-family: Calibri,
          Arial, Helvetica, sans-serif; background-color: rgb(255, 255,
          255)">
          <br>
        </div>
        <div style="margin: 0px; font-size: 12pt; font-family: Calibri,
          Arial, Helvetica, sans-serif; background-color: rgb(255, 255,
          255)">
          The reason I ask this is that I am writing an ARPA file
          myself, taking probabilities produced by a neural network.
          Because these probabilities are estimated by a neural net,
          they tend not to sum not 1 perfectly. I am hoping that SRILM
          can correct this. Otherwise, I will have to write a script to
          brute force it.</div>
        <div style="margin: 0px; font-size: 12pt; font-family: Calibri,
          Arial, Helvetica, sans-serif; background-color: rgb(255, 255,
          255)">
          <br>
        </div>
        <div style="margin: 0px; font-size: 12pt; font-family: Calibri,
          Arial, Helvetica, sans-serif; background-color: rgb(255, 255,
          255)">
          Werner</div>
        <br>
      </div>
      <div><a
href="https://gcc02.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.sun.ac.za%2Fenglish%2Fabout-us%2Fstrategic-documents&data=01%7C01%7Csrilm-user%40speech.sri.com%7Cd443b6b9943f498dfd5908d728ef58cd%7C40779d3379c44626b8bf140c4d5e9075%7C1&sdata=y1wlD1TMitrr5%2Bbb6ln9l0CKkKRkh8vLuZU9RcP8AGI%3D&reserved=0" originalSrc="http://www.sun.ac.za/english/about-us/strategic-documents" shash="jpTzyTlZ7IzywKZ0lzF9+um3tio+1jhm4DQQR9oOUZkHozpIYYCXucVeTl6kwxoUDV3p0YcdSf5Fbv7LhqBRRfSHzbLZ/K9muhSS1fwU6GHrSNAmk8afqCihzsSPuGp8tPnoyW5tSn0BWok8q50q7kCofb/Sg8MV0eQlogp9Lus="
originalsrc="http://www.sun.ac.za/english/about-us/strategic-documents"
shash="Pa+DT3ctCyafxhOqhglMWbaJh3HdLy1M0KEdPoU9DrVUGNG1swxlOUXzsMZjN+rbOrSZrHn4WJM+k90pYyQr3PVVJo0CDbjgtAqNSl5bBQJzJxot8NB1vnO167oUHOfvAx3ykRSZECgk3qOPRaK+8EPMv5tU2tVIaWBZXYmlo0c="
          moz-do-not-send="true"><img
            src="http://cdn.sun.ac.za/100/ProductionFooter.jpg"
            moz-do-not-send="true"></a></div>
      <br>
      <span style="font-size: 11px; font-family: 'Verdana';
        color:#9b9f9e;">The integrity and confidentiality of this email
        are governed by these terms.
        <a
href="https://gcc02.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.sun.ac.za%2Femaildisclaimer&data=01%7C01%7Csrilm-user%40speech.sri.com%7Cd443b6b9943f498dfd5908d728ef58cd%7C40779d3379c44626b8bf140c4d5e9075%7C1&sdata=tFFDwIA9FROFkatqxx90CkkUIvu45QbFHurS2IDZFNQ%3D&reserved=0
          " moz-do-not-send="true">Disclaimer</a><br>
        Die integriteit en vertroulikheid van hierdie e-pos word deur
        die volgende bepalings bereël.
        <a
href="https://gcc02.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.sun.ac.za%2Femaildisclaimer&data=01%7C01%7Csrilm-user%40speech.sri.com%7Cd443b6b9943f498dfd5908d728ef58cd%7C40779d3379c44626b8bf140c4d5e9075%7C1&sdata=tFFDwIA9FROFkatqxx90CkkUIvu45QbFHurS2IDZFNQ%3D&reserved=0
          " moz-do-not-send="true">Vrywaringsklousule</a></span>
      <br>
      <fieldset class="mimeAttachmentHeader"></fieldset>
      <pre class="moz-quote-pre" wrap="">_______________________________________________
SRILM-User site list
<a class="moz-txt-link-abbreviated" href="mailto:SRILM-User@speech.sri.com">SRILM-User@speech.sri.com</a>
<a class="moz-txt-link-freetext" href="http://mailman.speech.sri.com/cgi-bin/mailman/listinfo/srilm-user">http://mailman.speech.sri.com/cgi-bin/mailman/listinfo/srilm-user</a></pre>
    </blockquote>
    <p><br>
    </p>
  </body>
</html>