Re: Issue with fuzzy search in Distributed Search

Jack Krupansky Tue, 30 Apr 2013 08:00:08 -0700

A fuzzy query itself does not know about distributed search - Lucene simplyscores the query results based on the local index. Then, Solr is merging themerging the query results from different nodes.

Try the query locally for each node and set debugQuery=true and see how eachdocument gets scored.

I'm actually not sure what the specific "problem" (symptom) is that you areseeing. I mean, maybe there is only 1 result on that node - how do you knowotherwise?? Or maybe one node has more exact matches.


-- Jack Krupansky

-----Original Message-----From: meghana

Sent: Tuesday, April 30, 2013 7:51 AM
To: solr-user@lucene.apache.org
Subject: Issue with fuzzy search in Distributed Search

I have created 2 versions of Solr core in different servers. one is simple
core having all records in one core. And other is shards core, distributed
over 3 cores on server.

Simple core :

http://localhost:8080/sorl/core0/select?q=text:hoers~1

Distributed core :

http://192.168.1.91:8080/core0/select?shards=http://192.168.1.91:8080/core0,http://192.168.1.91:8080/core1,http://192.168.1.91:8080/core2&q=text:hoers~1

data, schema and other configuration is similar in both the cores.

but while doing fuzzy search like hoers~1 one core returns many
records(about 450), while other core return only 1 record.

While this issue does not seem related to Distributed Search, as Although i
do not use distributed search, then also it do not return more rows.

as http://192.168.1.91:8080/core0/select?q=text:hoers~1

below is schema definition for my field.

<fieldType name="text_en_splitting" class="solr.TextField"
positionIncrementGap="100" autoGeneratePhraseQueries="true">
     <analyzer type="index">
     <tokenizer class="solr.WhitespaceTokenizerFactory"/>
       <filter class="solr.StopFilterFactory"
               ignoreCase="true"
               words="stopwords.txt"
               enablePositionIncrements="false"
               />
       <filter class="solr.StopFilterFactory"
               ignoreCase="true"
               words="stopwords_en.txt"
               enablePositionIncrements="true"
               />
       <filter class="solr.WordDelimiterFilterFactory"
generateWordParts="1" generateNumberParts="1" catenateWords="1"
catenateNumbers="1" catenateAll="0" splitOnCaseChange="1"
protected="protwords.txt" types="wdfftypes.txt"  />
       <filter class="solr.LowerCaseFilterFactory"/>
       <filter class="solr.KeywordMarkerFilterFactory"
protected="protwords.txt"/>
       <filter class="solr.PorterStemFilterFactory"/>
     </analyzer>
     <analyzer type="query">
       <tokenizer class="solr.WhitespaceTokenizerFactory"/>
       <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"
ignoreCase="true" expand="true"/>
       <filter class="solr.StopFilterFactory"
               ignoreCase="true"
               words="stopwords_extra_query.txt"
               enablePositionIncrements="false"
               />
       <filter class="solr.StopFilterFactory"
               ignoreCase="true"
               words="stopwords_en.txt"
               enablePositionIncrements="true"
               />
       <filter class="solr.WordDelimiterFilterFactory"
generateWordParts="1" generateNumberParts="1" catenateWords="0"
catenateNumbers="0" catenateAll="0" splitOnCaseChange="1"
protected="protwords.txt" types="wdfftypes.txt"  />
       <filter class="solr.LowerCaseFilterFactory"/>
       <filter class="solr.KeywordMarkerFilterFactory"
protected="protwords.txt"/>
       <filter class="solr.PorterStemFilterFactory"/>
     </analyzer>
   </fieldType>

Not sure, what is wrong with this. Can anybody help me on this??




--

View this message in context:http://lucene.472066.n3.nabble.com/Issue-with-fuzzy-search-in-Distributed-Search-tp4060022.htmlSent from the Solr - User mailing list archive at Nabble.com.

Re: Issue with fuzzy search in Distributed Search

Reply via email to