Hi,

Are there any known good tools or approaches to "collapsing queries".
For example, imagine 4 original queries:
* big house
* big houses
* the big house
* bigger house

...and all 4 being reduced/collapsed to just "big house".

What might be some good approached for doing this?
1) stem them all and collapse if the are identical
2) compute levenstein distance and collapse if they are close enough

Maybe also remove stop words from them first? (not so good for queries
consisting of all or lots of stop words, like "to be or not to be")

Any better approaches?

Thanks,
Otis
--
Solr & ElasticSearch Support -- http://sematext.com/
Performance Monitoring -- http://sematext.com/spm

Reply via email to