Re: Spell checking question from a Solr novice

2010-11-29 Thread Bill Dueber
On Mon, Oct 18, 2010 at 5:24 PM, Jason Blackerby wrote: > If you know the misspellings you could prevent them from being added to the > dictionary with a StopFilterFactory like so: > Or, you know, correct the data :-) -- Bill Dueber Library Systems Programmer University of Michigan Library

Re: Spell checking question from a Solr novice

2010-10-18 Thread Dennis Gearon
The first question to ask is will it work for you. The SECOND question is do you want google to know what's in your data? Dennis Gearon Signature Warning It is always a good idea to learn from your own mistakes. It is usually a better idea to learn from others’ mistakes, so yo

Re: Spell checking question from a Solr novice

2010-10-18 Thread Ezequiel Calderara
You can cross the new words against a dictionary and keep them in the file as Jason described... What Pradeep said is true, is always better to have "suggestions" related to your index that have suggestions with no results... On Mon, Oct 18, 2010 at 6:24 PM, Jason Blackerby wrote: > If you know

Re: Spell checking question from a Solr novice

2010-10-18 Thread Jason Blackerby
If you know the misspellings you could prevent them from being added to the dictionary with a StopFilterFactory like so: where misspelled_words.txt contains the misspellings. On Mon, Oct 18, 2010 at 5:14 PM, Pradeep Singh wrot

Re: Spell checking question from a Solr novice

2010-10-18 Thread Pradeep Singh
I think a spellchecker based on your index has clear advantages. You can spellcheck words specific to your domain which may not be available in an outside dictionary. You can always dump the list from wordnet to get a starter english dictionary. But then it also means that misspelled words from yo

Re: Spell checking question from a Solr novice

2010-10-18 Thread Jonathan Rochkind
In general, the benefit of the built-in Solr spellcheck is that it can use a dictionary based on your actual index. If you want to use some external API, you certainly can, in your actual client app -- but it doesn't really need to involve Solr at all anymore, does it? Is there any benefit I'

Re: Spell checking question from a Solr novice

2010-10-18 Thread Pradeep Singh
I haven't yet but I was going to use the spell checker in the lucene contrib module. That spellchecker is ngram based and previously I have noticed that I get better results from ngram based spellcheck rather than fuzzy string match based ones. On Mon, Oct 18, 2010 at 12:43 PM, Xin Li wrote: > H

RE: Spell checking question from a Solr novice

2010-10-18 Thread Xin Li
Oops, never mind. Just read Google API policy. 1000 queries per day limit & for non-commercial use only. -Original Message- From: Xin Li Sent: Monday, October 18, 2010 3:43 PM To: solr-user@lucene.apache.org Subject: Spell checking question from a Solr novice Hi, I am looking for a