Hi, you should curate your data, that is fundamental to have an healthy search solution, but let's see what you can do anyway :
1) curate a dictionary of such bad words and then configure analysis to skip them 2) Have you tried different dictionary implementations ? I would assume that each single mispelled word has a low document frequency. You could use the High Frequency Document Dictionary[1] and see how it goes. [1] https://lucene.apache.org/solr/guide/7_3/suggester.html#highfrequencydictionaryfactory ----- --------------- Alessandro Benedetti Search Consultant, R&D Software Engineer, Director Sease Ltd. - www.sease.io -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html