Re: Word frequency count in the index

Otis Gospodnetic Thu, 16 Jul 2009 08:27:42 -0700

Plus there is a single class that you can run from the command line in Lucene's 
contrib.  I think it's called HighFreqTerms or something close to that.


 Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



----- Original Message ----
> From: Grant Ingersoll <gsing...@apache.org>
> To: solr-user@lucene.apache.org
> Sent: Thursday, July 16, 2009 6:35:28 AM
> Subject: Re: Word frequency count in the index
> 
> In the trunk version, the TermsComponent should give you this:  
> http://wiki.apache.org/solr/TermsComponent.  Also, you can use the 
> LukeRequestHandler to get the top words in each field.
> 
> Alternatively, you may just want to point Luke at your index.
> 
> On Jul 16, 2009, at 6:29 AM, Pooja Verlani wrote:
> 
> > Hi,
> > 
> > Is there any way in SOLR to know the count of each word indexed in the solr
> > ?
> > I want to find out the different word frequencies to figure out '
> > application specific stop words'.
> > 
> > Please let me know if its possible.
> > 
> > Thank you,
> > Regards,
> > Pooja
> 
> --------------------------
> Grant Ingersoll
> http://www.lucidimagination.com/
> 
> Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids) using 
> Solr/Lucene:
> http://www.lucidimagination.com/search

Re: Word frequency count in the index

Reply via email to