<div dir="ltr"><div>Hi,</div><div><br></div>I don't know if it has been asked before, but does it make sense to interpolate on the basis of smoothing instead of domain/genre?  What should be the assumptions in considering this when the resulting perplexity is lower than any of the two separately?<div><br></div><div>Let's say: 5-gram Katz yields 100, and 5-gram Modified KN yields 90</div><div>Then best-mix of the two yields 87</div><div><br></div><div class="cye-lm-tag">On a theoretical perspective, is it sound to simply trust that the interpolated LM is better/generalizable to different smoothing combinations?</div><div><br></div><div>-Fred</div></div>