The Word Break spellchecker will incorporate the broken & combined words in the 
collations.  Its designed to work seamlessly in conjunction with a "regular" 
spellchecker (IndexBased- or Direct-).  

James Dyer
Ingram Content Group
(615) 213-4311


-----Original Message-----
From: Nicholas Ding [mailto:nicholas...@gmail.com] 
Sent: Monday, May 13, 2013 12:07 PM
To: solr-user@lucene.apache.org
Subject: Re: Looking for Best Practice of Spellchecker

Thank you for you help, guys. I agreed, "wall mart" should be a synonyms,
it's not a good example.

I did an experiment by using KeywordTokenizer + DirectSolrSpellChecker, I
can get suggestion even for "wall mart" to "walmart". But I don't know
whether it's a good practice or not. It's much like a workaround to me. And
for WordBreakSpellChecker, I haven't tried it yet. Does this spellchecker
break the word and concatenate them then give me collations?

Thanks


On Fri, May 10, 2013 at 11:34 AM, Dyer, James
<james.d...@ingramcontent.com>wrote:

> Good point, Jason.  In fact, even if you use WorkBreakSpellChecker "wall
> mart" will not correct to "walmart".  The reason is the spellchecker cannot
> both correct a token's spelling *and* fix the wordbreak issue involving
> that same token.  So in this case a synonym is the way to go.
>
> James Dyer
> Ingram Content Group
> (615) 213-4311
>
>
> -----Original Message-----
> From: Jason Hellman [mailto:jhell...@innoventsolutions.com]
> Sent: Friday, May 10, 2013 9:55 AM
> To: solr-user@lucene.apache.org
> Subject: Re: Looking for Best Practice of Spellchecker
>
> Nicholas,
>
> Also consider that some misspellings are better handled through Synonyms
> (or injected metadata).
>
> You can garner a great deal of value out of the spell checker by following
> the great advice James is giving here...but you'll find a well-placed
> "helper" synonym or metavalue can often save a lot of headache and time.
>
> Jason
>
> On May 10, 2013, at 7:32 AM, "Dyer, James" <james.d...@ingramcontent.com>
> wrote:
>
> > Nicholas,
> >
> > It sounds like you might want to use WordBreakSolrSpellChecker, which
> gets obscure mention in the wiki.  Read through this section:
> http://wiki.apache.org/solr/SpellCheckComponent#Configuration and you
> will see some information.
> >
> > Also, the Solr Example shows how to configure this.  See
> http://svn.apache.org/repos/asf/lucene/dev/branches/branch_4x/solr/example/solr/collection1/conf/solrconfig.xml
> >
> > Look for...
> >
> > <lst name="spellchecker">
> >  <str name="name">wordbreak</str>
> >  ...
> > </lst>
> >
> > ...and...
> >
> > <requestHandler name="/spell" ...>
> > ...
> > </requestHandler>
> >
> > Also, I'd recommend you take a look at each parameter in the "/spell"
> request handler and read its section on the "spellcheckcomponent" wiki
> page.  You probably will want to set many of these parameters as well.
> >
> > You can get a query to return only spell results simply by specifying
> "rows=0".  However, its one less query to just have it return the results
> also.  If there are no results, your application can check for collations
> and re-issue a collation query.  If there are both results and collations
> returned, you can give the user results with "did-you-mean" suggestions.
> >
> > James Dyer
> > Ingram Content Group
> > (615) 213-4311
> >
> >
> > -----Original Message-----
> > From: Nicholas Ding [mailto:nicholas...@gmail.com]
> > Sent: Friday, May 10, 2013 8:47 AM
> > To: solr-user@lucene.apache.org
> > Subject: Looking for Best Practice of Spellchecker
> >
> > Hi guys,
> >
> > I'm working on a local search project, I wanna integrate spellchecker for
> > the search.
> >
> > So basically, my search engines is used to search local businesses. For
> > example, user could search for "wall mart", here is a typo, I wanna
> > spellchecker to give me Collation for "walmart".
> >
> > My problems are:
> > 1. I use DirectSolrSpellChecker on my BusinessNameField and pass "wall
> > mart" as phrase search, but I can't get collation from the spellchecker.
> > 2. I tried not to pass phrase search, but pass q=Wall AND Mart to force a
> > 100% match, but spellchecker can't give me collation also.
> >
> > I read the documents about spellchecker on Solr wiki, but it's very
> brief.
> > I'm wondering is there any best practice of spellchecker, I believe it's
> > widely used in the search, right?
> >
> > And I have another idea, I don't know whether it's valid or not. I want
> to
> > apply spellchecker everything before doing the search, so that I could
> rely
> > on the spellchecker to tell me whether my search could get result or not.
> >
> > Thanks
> > Nicholas
> >
>
>
>
>

Reply via email to