It would help if we had a clear description of what the analyzer is SUPPOSED
to do, since it seems to do a lot of stuff that may or may not make sense.
In other words, what is the original problem the analzer is trying to
solve - in detail?
-- Jack Krupansky
-----Original Message-----
From: Ahmet Arslan
Sent: Monday, December 31, 2012 11:58 AM
To: solr-user@lucene.apache.org
Subject: Re: search with spaces
hi Sangeetha,
Your field type definition is somehow weird.
Since you use solr.LowerCaseTokenizerFactory" you don't need
LowerCaseFilterFactory after that.
Your tokenizer keeps only letters. Number and other thing do not survive
after tokenization. It is letter tokenizer under the hood.
NGramFilterFactory combined with stem filter may cause something like that:
implementation => will produce men at index time. stemmer will convert it
into man. And query man will match implementation. Just an example.
Just a guess may be one length words cannot survive ngram filter? Or you
have O in stopwords.txt?
analysis admin page shows step by step all analysis phases. You can enter
your example document and query there and see if there is a match or not.
It will show you what happens to your "0" "O"
Other factor is which query parser are you using? dismax lucene etc.
analysis page does not perform actual query parsing. output of
&debugQuery=on is critical.
Try set setting &q.op=OR (for lucene query parser) or &mm=1 (dismax query
parser)
--- On Mon, 12/31/12, Sangeetha <sangeetha...@gmail.com> wrote:
From: Sangeetha <sangeetha...@gmail.com>
Subject: Re: search with spaces
To: solr-user@lucene.apache.org
Date: Monday, December 31, 2012, 7:55 AM
Hi,
I am not using custom solr plugin.
I am not getting the results. There is no exception.
Here is my fieldType definition,
<fieldType name="text" class="solr.TextField"
positionIncrementGap="100">
<analyzer
type="index">
<tokenizer
class="solr.LowerCaseTokenizerFactory" />
<filter
class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords.txt" />
<filter
class="solr.LowerCaseFilterFactory" />
<filter
class="solr.KeywordMarkerFilterFactory"
protected="protwords.txt" />
<filter
class="solr.NGramFilterFactory" minGramSize="2"
maxGramSize="15" side="front"/>
<filter
class="solr.NGramFilterFactory" minGramSize="2"
maxGramSize="15" side="back"/>
<filter
class="solr.PorterStemFilterFactory" />
<filter
class="solr.RemoveDuplicatesTokenFilterFactory" />
</analyzer>
<analyzer
type="query">
<tokenizer
class="solr.LowerCaseTokenizerFactory" />
<filter
class="solr.SynonymFilterFactory"
synonyms="synonyms.txt" ignoreCase="true" expand="true"
/>
<filter
class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords.txt" />
<filter
class="solr.LowerCaseFilterFactory" />
<filter
class="solr.KeywordMarkerFilterFactory"
protected="protwords.txt" />
<filter
class="solr.PorterStemFilterFactory" />
<filter
class="solr.RemoveDuplicatesTokenFilterFactory"/>
</analyzer>
</fieldType>
Thanks,
Sangeetha
--
View this message in context:
http://lucene.472066.n3.nabble.com/search-with-spaces-tp4029265p4029755.html
Sent from the Solr - User mailing list archive at
Nabble.com.