> I've been looking at the Solr TermVectorComponent
> (http://wiki.apache.org/solr/TermVectorComponent) and it
> seems to have
> something similar to this, but it looks to me like this is
> a component
> that is processed at query time (?) and is limited to
> 1-gram terms.
If you use <filter class="solr.ShingleFilterFactory" maxShingleSize="2"
outputUnigrams="false"/> it can give you info about 2-gram terms.
> Also, the tf/idf scores are a little different as they come
> back in integer values as separate components.
In wiki, example output only tf and df values - which are integer - are
displayed. You can calculate tf*idf (double) with these parameters:
&qt=tvrh&tv=true&fl=yourFieldName&tv.tf=true&tv.df=true&tv.tf_idf=true