Ahmet, SpellChecker seems to be the the exact thing that I need for fuzzy type search , how can I combine SpellChecker with something like edismax parser to make use of paramerters like pf,pf2 and pf3 . Is there any resource that you can point me to do that ?
Thanks. On Wed, Apr 2, 2014 at 9:12 PM, S.L <simpleliving...@gmail.com> wrote: > Thanks Ahmet, I would definitely look into this . I appreciate that. > > > On Wed, Apr 2, 2014 at 7:47 PM, Ahmet Arslan <iori...@yahoo.com> wrote: > >> Yes, it has spellcheck.collate parameter. I mean it has lots of >> parameters and with correct combination of parameters >> it can suggest "White Siberian Ginseng" from "Whte Sberia Ginsng" >> >> https://cwiki.apache.org/confluence/display/solr/Spell+Checking >> >> >> >> >> On Thursday, April 3, 2014 1:57 AM, "simpleliving...@gmail.com" < >> simpleliving...@gmail.com> wrote: >> Ahmet. >> >> Thanks I will look into this option . Does spellchecker support multiple >> word search terms? >> >> Sent from my HTC >> >> ----- Reply message ----- >> From: "Ahmet Arslan" <iori...@yahoo.com> >> To: "solr-user@lucene.apache.org" <solr-user@lucene.apache.org> >> Subject: eDismax parser and the mm parameter >> Date: Wed, Apr 2, 2014 10:53 AM >> >> Hi SL, >> >> Instead of fuzzy queries, can't you use spell checker? Generally Spell >> Checker (a.k.a did you mean) is a preferred tool for typos. >> >> Ahmet >> >> On Wednesday, April 2, 2014 4:13 PM, "simpleliving...@gmail.com" < >> simpleliving...@gmail.com> wrote: >> >> It only works for a single word search term and not multiple word search >> term. >> >> Sent from my HTC >> >> ----- Reply message ----- >> From: "William Bell" <billnb...@gmail.com> >> To: "solr-user@lucene.apache.org" <solr-user@lucene.apache.org> >> Subject: eDismax parser and the mm parameter >> Date: Wed, Apr 2, 2014 12:03 AM >> >> Fuzzy is provided use ~ >> >> >> On Mon, Mar 31, 2014 at 11:04 PM, S.L <simpleliving...@gmail.com> wrote: >> >> > Jack , >> > >> > Thanks a lot , I am now using the pf ,pf2 an pf3 and have gotten rid of >> > the mm parameter from my queries, however for the fuzzy phrase queries >> , I >> > am not sure how I would be able to leverage the Complex Query Parser >> there >> > is absolutely nothing out there that gives me any idea as to how to do >> that >> > . >> > >> > Why is fuzzy phrase search not provided by Solr OOB ? I am surprised >> > >> > Thanks. >> > >> > >> > On Mon, Mar 31, 2014 at 5:39 AM, Jack Krupansky < >> j...@basetechnology.com >> > >wrote: >> > >> > > The pf, pf2, and pf3 parameters should cover cases 1 and 2. Use >> q.op=OR >> > > (the default) and ignore the mm parameter. Give pf the highest boost, >> and >> > > boost pf3 higher than pf2. >> > > >> > > You could try using the complex phrase query parser for the third >> case. >> > > >> > > -- Jack Krupansky >> > > >> > > -----Original Message----- From: S.L >> > > Sent: Monday, March 31, 2014 12:08 AM >> > > To: solr-user@lucene.apache.org >> > > Subject: Re: eDismax parser and the mm parameter >> > > >> > > Thanks Jack , my use cases are as follows. >> > > >> > > >> > > 1. Search for "Ginseng" everything related to ginseng should show >> up. >> > > 2. Search For "White Siberian Ginseng" results with the whole phrase >> > > show up first followed by 2 words from the phrase followed by a >> single >> > > word >> > > in the phrase >> > > 3. Fuzzy Search "Whte Sberia Ginsng" (please note the typos here) >> > > documents with White Siberian Ginseng Should show up , this looks >> like >> > > the >> > > most complicated of all as Solr does not support fuzzy phrase >> searches >> > . >> > > (I >> > > have no solution for this yet). >> > > >> > > Thanks again! >> > > >> > > >> > > On Sun, Mar 30, 2014 at 11:21 PM, Jack Krupansky < >> > j...@basetechnology.com> >> > > wrote: >> > > >> > > The mm parameter is really only relevant when the default operator >> is OR >> > >> or explicit OR operators are used. >> > >> >> > >> Again: Please provide your use case examples and your expectations >> for >> > >> each use case. It really doesn't make a lot of sense to prematurely >> > focus >> > >> on a solution when you haven't clearly defined your use cases. >> > >> >> > >> -- Jack Krupansky >> > >> >> > >> -----Original Message----- From: S.L >> > >> Sent: Sunday, March 30, 2014 9:13 PM >> > >> To: solr-user@lucene.apache.org >> > >> Subject: Re: eDismax parser and the mm parameter >> > >> >> > >> Jack, >> > >> >> > >> I mis-stated the problem , I am not using the OR operator as default >> > >> now(now that I think about it it does not make sense to use the >> default >> > >> operator OR along with the mm parameter) , the reason I want to use >> pf >> > and >> > >> mm in conjunction is because of my understanding of the edismax >> parser >> > and >> > >> I have not looked into pf2 and pf3 parameters yet. >> > >> >> > >> I will state my understanding here below. >> > >> >> > >> Pf - Is used to boost the result score if the complete phrase >> matches. >> > >> mm <(less than) search term length would help limit the query results >> > to >> > >> a >> > >> certain number of better matches. >> > >> >> > >> With that being said would it make sense to have dynamic mm (set to >> the >> > >> length of search term - 1)? >> > >> >> > >> I also have a question around using a fuzzy search along with eDismax >> > >> parser , but I will ask that in a seperate post once I go thru that >> > aspect >> > >> of eDismax parser. >> > >> >> > >> Thanks again ! >> > >> >> > >> >> > >> >> > >> >> > >> >> > >> On Sun, Mar 30, 2014 at 6:44 PM, Jack Krupansky < >> > j...@basetechnology.com> >> > >> wrote: >> > >> >> > >> If you use pf, pf2, and pf3 and boost appropriately, the effects of >> mm >> > >> >> > >>> will be dwarfed. >> > >>> >> > >>> The general goal is to assure that the top documents really are the >> > best, >> > >>> not to necessarily limit the total document count. Focusing on the >> > latter >> > >>> could be a real waste of time. >> > >>> >> > >>> It's still not clear why or how you need or want to use OR as the >> > default >> > >>> operator - you still haven't given us a use case for that. >> > >>> >> > >>> To repeat: Give us a full set of use cases before taking this XY >> > Problem >> > >>> approach of pursuing a solution before the problem is understood. >> > >>> >> > >>> -- Jack Krupansky >> > >>> >> > >>> -----Original Message----- From: S.L >> > >>> Sent: Sunday, March 30, 2014 6:14 PM >> > >>> To: solr-user@lucene.apache.org >> > >>> Subject: Re: eDismax parser and the mm parameter >> > >>> >> > >>> Jacks Thanks Again, >> > >>> >> > >>> I am searching Chinese medicine documents , as the example I gave >> > >>> earlier >> > >>> a user can search for "Ginseng" or Siberian Ginseng or Red Siberian >> > >>> Ginseng >> > >>> , I certainly want to use pf parameter (which is not driven by mm >> > >>> parameter) , however for giving higher score to documents that have >> > more >> > >>> of >> > >>> the terms I want to use edismax now if I give a mm of 3 and the >> search >> > >>> term >> > >>> is of only length 1 (like "Ginseng") what does edisMax do ? >> > >>> >> > >>> >> > >>> On Sun, Mar 30, 2014 at 1:21 PM, Jack Krupansky < >> > j...@basetechnology.com >> > >>> > >> > >>> wrote: >> > >>> >> > >>> It still depends on >> > >