Trying to shoehorn business name resolution or correction purely into
Solr tokenization and spell checking is not, in my opinion, a viable
approach. It seems to me that you need a query parser that does
something very different from pure tokenization, and you might also
need a more complex approach to matching names. Full disclosure: I
work for a company that builds one of those. You could talk to us, or
you could at least look at the problem from the point of view of our
approach: take the business names, index them in some way that allows
for fuzzy matching (which is _not_ just treating them as ordinary
tokenized text), then take the queries, and map them to fuzzy
matching. The whole business is comparable to the geo support in Solr:
a special data type that is treated with domain-specific techniques.

Reply via email to