Re: search with spaces

Jack Krupansky Mon, 31 Dec 2012 09:33:55 -0800

It would help if we had a clear description of what the analyzer is SUPPOSEDto do, since it seems to do a lot of stuff that may or may not make sense.In other words, what is the original problem the analzer is trying tosolve - in detail?


-- Jack Krupansky

-----Original Message-----From: Ahmet Arslan

Sent: Monday, December 31, 2012 11:58 AM
To: [email protected]
Subject: Re: search with spaces

hi Sangeetha,

Your field type definition is somehow weird.

Since you use solr.LowerCaseTokenizerFactory" you don't needLowerCaseFilterFactory after that.

Your tokenizer keeps only letters. Number and other thing do not surviveafter tokenization. It is letter tokenizer under the hood.


NGramFilterFactory combined with stem filter may cause something like that:

implementation => will produce men at index time. stemmer will convert itinto man. And query man will match implementation. Just an example.

Just a guess may be one length words cannot survive ngram filter? Or youhave O in stopwords.txt?

analysis admin page shows step by step all analysis phases. You can enteryour example document and query there and see if there is a match or not.It will show you what happens to your "0" "O"


Other factor is which query parser are you using? dismax lucene etc.

analysis page does not perform actual query parsing. output of&debugQuery=on is critical.

Try set setting &q.op=OR (for lucene query parser) or &mm=1 (dismax queryparser)




--- On Mon, 12/31/12, Sangeetha <[email protected]> wrote:

From: Sangeetha <[email protected]>
Subject: Re: search with spaces
To: [email protected]
Date: Monday, December 31, 2012, 7:55 AM
Hi,
I am not using custom solr plugin.

I am not getting the results. There is no exception.

Here is my fieldType definition,

 <fieldType name="text" class="solr.TextField"
positionIncrementGap="100">
         <analyzer
type="index">
            <tokenizer
class="solr.LowerCaseTokenizerFactory" />
            <filter
class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords.txt" />

            <filter
class="solr.LowerCaseFilterFactory" />
            <filter
class="solr.KeywordMarkerFilterFactory"
protected="protwords.txt" />
            <filter
class="solr.NGramFilterFactory" minGramSize="2"
maxGramSize="15" side="front"/>
            <filter
class="solr.NGramFilterFactory" minGramSize="2"
maxGramSize="15" side="back"/>
            <filter
class="solr.PorterStemFilterFactory" />
            <filter
class="solr.RemoveDuplicatesTokenFilterFactory" />
         </analyzer>
         <analyzer
type="query">
            <tokenizer
class="solr.LowerCaseTokenizerFactory" />
            <filter
class="solr.SynonymFilterFactory"
synonyms="synonyms.txt" ignoreCase="true" expand="true"
/>
            <filter
class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords.txt" />

            <filter
class="solr.LowerCaseFilterFactory" />
            <filter
class="solr.KeywordMarkerFilterFactory"
protected="protwords.txt" />
            <filter
class="solr.PorterStemFilterFactory" />
            <filter
class="solr.RemoveDuplicatesTokenFilterFactory"/>
         </analyzer>
      </fieldType>

Thanks,
Sangeetha



--

View this message in context:http://lucene.472066.n3.nabble.com/search-with-spaces-tp4029265p4029755.html

Sent from the Solr - User mailing list archive at
Nabble.com.

Re: search with spaces

Reply via email to