Sean Timm wrote:
To solve this, we wrote our own Similarity class which extends DefaultSimilarity and maps numTerms 1-10 to precalculated values between 1.5f and 0.3125f. For numTerms >10, we use the standard formula above. If anyone else is interested in this, I can post the code as a patch in Jira.Does this actually have a good measurable affect for you? Wouldn't it make more sense to just turn off norms for short fields?