On 4/21/10 3:22 PM, Robert Muir wrote:
On Wed, Apr 21, 2010 at 2:26 PM, Mark Miller<markrmil...@gmail.com> wrote:
Its an orthogonal issue - running will have that problem no matter what. It
doesn't affect whether a user that types running may be just as interested
in a doc that matches all of their other terms but has ran instead of
running. Its also just a simple example.
Its not orthogonal, e.g. "running water"
Stemming/lematization will pretty much always improve recall at the cost
of precision - that's nothing new. If you stem instead, are you going to
want documents that had run and water when you searched for running
water? I just don't see this point as an argument against lemmatization
and in favor of stemming.
- Mark