You should really look at Lucene first, if you want to know this type of stuff. TF - # of occurrences of a term in a single doc DF - # of occurrences of a term in the corpus/index (IDF is the inverse DF)
But look....google... http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/javadoc/org/apache/lucene/search/Similarity.html Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch ----- Original Message ---- From: anuvenk <[EMAIL PROTECTED]> To: solr-user@lucene.apache.org Sent: Saturday, January 5, 2008 4:25:08 PM Subject: what are tf,idf,fieldNorm,queryNorm.? I understand tf means term frequency. For eg: if the search term is 'chapter 7', does tf mean how frequently 'chapter 7' occurs in the docs? Does it take in to account the total number of words in a doc to determine frequency. Also what is idf, fieldNorm and queryNorm. Trying to understand how solr calculates the solr score. -- View this message in context: http://www.nabble.com/what-are-tf%2Cidf%2CfieldNorm%2CqueryNorm.--tp14639048p14639048.html Sent from the Solr - User mailing list archive at Nabble.com.