You should really look at Lucene first, if you want to know this type of stuff.
TF - # of occurrences of a term in a single doc
DF - # of occurrences of a term in the corpus/index (IDF is the inverse DF)


But look....google...

http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/javadoc/org/apache/lucene/search/Similarity.html

Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

----- Original Message ----
From: anuvenk <[EMAIL PROTECTED]>
To: solr-user@lucene.apache.org
Sent: Saturday, January 5, 2008 4:25:08 PM
Subject: what are tf,idf,fieldNorm,queryNorm.?


I understand tf means term frequency. For eg: if the search term is
 'chapter
7', does tf mean how frequently 'chapter 7' occurs in the docs? Does it
 take
in to account the total number of words in a doc to determine
 frequency.
Also what is idf, fieldNorm and queryNorm. Trying to understand how
 solr
calculates the solr score. 
-- 
View this message in context:
 
http://www.nabble.com/what-are-tf%2Cidf%2CfieldNorm%2CqueryNorm.--tp14639048p14639048.html
Sent from the Solr - User mailing list archive at Nabble.com.




Reply via email to