Isn't the TermVectorComponent more for one document at a time, and the
TermsComponent for the whole index?

Actually -- having done some digging... What I'm really after is the most
informative terms in a given document, which should take into account global
document frequency as well as term frequency in the document at hand. I
think I can use the MoreLikeThisHandler to do this, with a bit of
experimentation...

Thanks for the facet mincount tip BTW.

Andrew.


Avlesh Singh wrote:
> 
> For facets -
> http://wiki.apache.org/solr/SimpleFacetParameters#facet.mincount
> For terms - http://wiki.apache.org/solr/TermsComponent
> 
> Helps?
> 
> Cheers
> Avlesh
> 
> On Wed, Oct 28, 2009 at 11:32 PM, Andrew Clegg
> <andrew.cl...@gmail.com>wrote:
> 
>>
>> Hi,
>>
>> If I give a query that matches a single document, and facet on a
>> particular
>> field, I get a list of all the terms in that field which appear in that
>> document.
>>
>> (I also get some with a count of zero, I don't really understand where
>> they
>> come from... ?)
>>
>> Is it possible with faceting, or a similar mechanism, to get a count of
>> how
>> many times each term appears within that document?
>>
>> This would be really useful for building a list of top keywords within a
>> long document, for summarization purposes. I can do it on the client side
>> but it'd be nice to know if there's a quicker way.
>>
>> Thanks!
>>
>> Andrew.
>>
>> --
>> View this message in context:
>> http://www.nabble.com/Faceting-within-one-document-tp26099278p26099278.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
>>
>>
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Faceting-within-one-document-tp26099278p26099847.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to