Gert Brinkmann wrote:
<int name="distinct">57971</int> <!-- is this much? -->
Its a lot for a small index. The fuzzy query will enumerate all of those terms and calculate an edit distance. Its not an insane amount of work, but it jives with the slowness you see. Doing that 60,000 times for a query is not that fast.

Unfortunately, without the prefix setting, FuzzyQueries are slow, slow with that many uniques. Solr should def allow the prefix to be set. There was talk a couple years back about changing the default prefix value in Lucene because its so slow - but it didn't happen. The developers decided that you could tweak it yourself if you needed to be able to scale (if you add a prefix length, up to that length won't be fuzzy). Unfortunately, Solr hasnt yet given this option to my knowledge.

- Mark

Reply via email to