On 1-Oct-07, at 7:06 AM, [EMAIL PROTECTED] wrote:
Hi!
I have a question about the correlation between the score value and
the
term frequency. Let's assume that we have one index about one set of
documents. In addition to that, let's assume that there is only one
term
in a query.
If we now search for the term "car" and get a certain score value
X, and
if we then search for the term "football" and get the same score
value X.
Is it now sure that both values X are the same?
Could you explain, what correlation between the score value and the
term
frequency exists in my scenario?
If the field has norms, there is a corrolation but the tf is
unrecoverable from the score, because of field length normalization.
query normalization also makes it difficult to compare scores from
query to query.
see http://lucene.apache.org/java/docs/scoring.html to start out, in
particular the link to the Similarity class javadocs.
-Mike