Is that with Lucene or with Solr? Because Solr has several different spell-checker modules you can configure. I would recommend trying them first.
And, frankly, I still don't know what your business case is. Regards, Alex. ---- Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: http://www.solr-start.com/ On 1 October 2015 at 12:38, Mark Fenbers <mark.fenb...@noaa.gov> wrote: > Yes, and I've spend numerous hours configuring and reconfiguring, and > eventually even starting over, but still have not getting it to work right. > Even now, I'm getting bizarre results. For example, I query "NOTE: This > is purely as an example." and I get back really bizarre suggestions, like > "n ot e" and "n o te" and "n o t e" for the first word which isn't even > misspelled! The same goes for "purely" and "example" also! Moreover, I get > extended results showing the frequencies of these suggestions being over > 2600 occurrences, when I'm not even using an indexed spell checker. I'm > only using a file-based spell checker (/usr/shar/dict/words), and the > wordbreak checker. > > At this point, I can't even figure out how to narrow down my confusion so > that I can post concise questions to the group. But I'll get there > eventually, starting with removing the wordbreak checker for the time-being. > Your response was encouraging, at least. > > Mark > > > > On 10/1/2015 9:45 AM, Alexandre Rafalovitch wrote: >> >> Hi Mark, >> >> Have you gone through a Solr tutorial yet? If/when you do, you will >> see you don't need to code any of this. It is configured as part of >> the web-facing total offering which are tweaked by XML configuration >> files (or REST API calls). And most of the standard pipelines are >> already pre-configured, so you don't need to invent them from scratch. >> >> On your specific question, it would be better to ask what _business_ >> level functionality you are trying to achieve and see if Solr can help >> with that. Starting from Lucene code is less useful :-) >> >> Regards, >> Alex. >> ---- >> Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: >> http://www.solr-start.com/ >> >> >> On 1 October 2015 at 07:48, Mark Fenbers <mark.fenb...@noaa.gov> wrote: