David, I can imagine that "DF for resultset" is facets!
On Fri, Feb 7, 2014 at 11:26 PM, David Miller <davthehac...@gmail.com>wrote: > Hi Mikhail, > > The DF seems to be based on the entire document set. What I require is > based on a the results of a single query. > > Suppose my Solr query returns a set of 50K documents from a superset of > 10Million documents, I require to calculate the DF just based on the 50K > documents. But currently it seems to be calculated on the entire doc set. > > So, is there any way to get the DF or IDF just on basis of the docs > returned by the query? > > Regards, > Dave > > > > > > > > On Fri, Feb 7, 2014 at 5:15 AM, Mikhail Khludnev < > mkhlud...@griddynamics.com > > wrote: > > > Hello Dave > > you can get DF from http://wiki.apache.org/solr/TermsComponent (invert > it > > yourself) > > then, for certain term you can get number of occurrences per document by > > http://wiki.apache.org/solr/FunctionQuery#tf > > > > > > > > On Fri, Feb 7, 2014 at 3:58 AM, David Miller <davthehac...@gmail.com> > > wrote: > > > > > Hi Guys.. > > > > > > I require to obtain Tf-idf score from Solr for a certain set of > > documents. > > > But the catch is that, I needs the IDF (or DF) to be calculated on the > > > documents returned by the specific query and not the entire corpus. > > > > > > Please provide me some hint on whether Solr has this feature or if I > can > > > use the Lucene Api directly to achieve this. > > > > > > > > > Thanks in advance, > > > Dave > > > > > > > > > > > -- > > Sincerely yours > > Mikhail Khludnev > > Principal Engineer, > > Grid Dynamics > > > > <http://www.griddynamics.com> > > <mkhlud...@griddynamics.com> > > > -- Sincerely yours Mikhail Khludnev Principal Engineer, Grid Dynamics <http://www.griddynamics.com> <mkhlud...@griddynamics.com>