Hi-
I'm working on a case where we have review text that may include words
that describe what the item is *not*.
Given the text "the kitten is not clean", searching for "clean" should
not include (at least at the top) the kitten.
The approach I am considering is to copy the text to a negation field
and do simple heuristic analysis in a TokenFilter. This analysis
would only keep tokens for words that follow "not", then we could add
a negative boost for this field:
title^2 content^1 negation^0.1
Does this seem like a reasonable approach? Any other ideas /
suggestions / pointers?
thanks
ryan