The Solr Terms Component will give you the terms in the index and the document frequency of each.

https://cwiki.apache.org/confluence/display/solr/The+Terms+Component

-- Jack Krupansky

-----Original Message----- From: Domma, Achim
Sent: Monday, August 19, 2013 3:09 AM
To: solr-user@lucene.apache.org
Subject: Create term vector from text

Hi,

the TermVectorComponent allows me to retrieve data about the terms of a
document, including tf-idf. Is it possible to get this data for a text, but
without storing it in SOLR? As far as I figured out, the AnalysisComponent
comes close, but does not return the core specific frequencies. Obviously
the MLT handler has to do something like that internally. I tried to read
the code and figured out, that there's a Lucene function to create a query
from a text.

Is there some public interface which allows me to access that kind of
functionality? Or do I have to write my own SearchComponent?

cheers,
Achim

Reply via email to