> What is the best way to essentially get a term frequency > vector for > multi-word terms?
To use solr.ShingleFilterFactory and TermVectorComponent. http://wiki.apache.org/solr/TermVectorComponent http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ShingleFilterFactory