Hi-

I'm working on a case where we have review text that may include words that describe what the item is *not*.

Given the text "the kitten is not clean", searching for "clean" should not include (at least at the top) the kitten.

The approach I am considering is to copy the text to a negation field and do simple heuristic analysis in a TokenFilter. This analysis would only keep tokens for words that follow "not", then we could add a negative boost for this field:
  title^2 content^1 negation^0.1

Does this seem like a reasonable approach? Any other ideas / suggestions / pointers?

thanks
ryan

Reply via email to