Yes, and I've spend numerous hours configuring and reconfiguring, and eventually even starting over, but still have not getting it to work right. Even now, I'm getting bizarre results. For example, I query "NOTE: This is purely as an example." and I get back really bizarre suggestions, like "n ot e" and "n o te" and "n o t e" for the first word which isn't even misspelled! The same goes for "purely" and "example" also! Moreover, I get extended results showing the frequencies of these suggestions being over 2600 occurrences, when I'm not even using an indexed spell checker. I'm only using a file-based spell checker (/usr/shar/dict/words), and the wordbreak checker.

At this point, I can't even figure out how to narrow down my confusion so that I can post concise questions to the group. But I'll get there eventually, starting with removing the wordbreak checker for the time-being. Your response was encouraging, at least.

Mark


On 10/1/2015 9:45 AM, Alexandre Rafalovitch wrote:
Hi Mark,

Have you gone through a Solr tutorial yet? If/when you do, you will
see you don't need to code any of this. It is configured as part of
the web-facing total offering which are tweaked by XML configuration
files (or REST API calls). And most of the standard pipelines are
already pre-configured, so you don't need to invent them from scratch.

On your specific question, it would be better to ask what _business_
level functionality you are trying to achieve and see if Solr can help
with that. Starting from Lucene code is less useful :-)

Regards,
    Alex.
----
Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter:
http://www.solr-start.com/


On 1 October 2015 at 07:48, Mark Fenbers <mark.fenb...@noaa.gov> wrote:

Reply via email to