I believe I found a solution: use a third-party stemmer to stem the term
first, then pass it to termfreq.

The only trick is, each term in a phrase has to be stemmed separately (i.e.
"end-user experience" has to be broken down into "end-user" -> "end-us" and
"experience" -> "experi") before being passed, i.e. termfreq(body, "end-us
experi").

>From what I can tell, FunctionQuery / termfreq doesn't have a way to apply
stemming.

Akos (Aki) Balogh
Co-Founder, MarketMuse
https://www.MarketMuse.com <https://www.marketmuse.com/>


On Fri, Jul 24, 2015 at 12:04 PM, Aki Balogh <a...@marketmuse.com> wrote:

> Hi All,
>
> I'm using TermVectorComponent and stemming (Porter) in order to get term
> frequencies with fuzzy matching. I'm stemming at index and query time.
>
> Is there a way to get term frequency from the index?
> * termfreq doesn't support stemming or wildcards
> * terms component doesn't allow additional filters
> * I could use a copyfield to save a non-stemmed version at indexing, and
> run termfreq on that, but then I don't get any fuzzy matching
>
> Thanks,
> Aki
>

Reply via email to