Hi Mikhail,

The DF seems to be based on the entire document set. What I require is
based on a the results of a single query.

Suppose my Solr query returns a set of 50K documents from a superset of
10Million documents, I require to calculate the DF just based on the 50K
documents. But currently it seems to be calculated on the entire doc set.

So, is there any way to get the DF or IDF just on basis of the docs
returned by the query?

Regards,
Dave







On Fri, Feb 7, 2014 at 5:15 AM, Mikhail Khludnev <mkhlud...@griddynamics.com
> wrote:

> Hello Dave
> you can get DF from http://wiki.apache.org/solr/TermsComponent (invert it
> yourself)
> then, for certain term you can get number of occurrences per document by
> http://wiki.apache.org/solr/FunctionQuery#tf
>
>
>
> On Fri, Feb 7, 2014 at 3:58 AM, David Miller <davthehac...@gmail.com>
> wrote:
>
> > Hi Guys..
> >
> > I require to obtain Tf-idf score from Solr for a certain set of
> documents.
> > But the catch is that, I needs the IDF (or DF) to be calculated on the
> > documents returned by the specific query and not the entire corpus.
> >
> > Please provide me some hint on whether Solr has this feature or if I can
> > use the Lucene Api directly to achieve this.
> >
> >
> > Thanks in advance,
> > Dave
> >
>
>
>
> --
> Sincerely yours
> Mikhail Khludnev
> Principal Engineer,
> Grid Dynamics
>
> <http://www.griddynamics.com>
>  <mkhlud...@griddynamics.com>
>

Reply via email to