Lucene uses a tf.idf relevance formula, so it automatically finds common words (stop words) in your documents and gives them lower weight. I recommend not removing stop words at all and letting Lucene handle the weighting.
wunder On 7/16/09 3:29 AM, "Pooja Verlani" <pooja.verl...@gmail.com> wrote: > Hi, > > Is there any way in SOLR to know the count of each word indexed in the solr > ? > I want to find out the different word frequencies to figure out ' > application specific stop words'. > > Please let me know if its possible. > > Thank you, > Regards, > Pooja