Hi,
I have similar kind of problem where I want search for a words with spaces
in that. And I wanted to search by stripping all the spaces .
I have used following schema for that
<fieldType name="nospaces" class="solr.TextField"
autoGeneratePhraseQueries="true" >
<analyzer type="index">
<tokenizer class="solr.KeywordTokenizerFactory"/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.PatternReplaceFilterFactory"
pattern="[^\w]+" replacement="" replace="all"/>
</analyzer>
<analyzer type="query">
<tokenizer class="solr.KeywordTokenizerFactory"/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.PatternReplaceFilterFactory"
pattern="[^\w]+" replacement="" replace="all"/>
</analyzer>
</fieldType>
And
<field name="text_nospaces" type="nospaces" indexed="true" stored="true"
omitNorms="true" />
<copyField source="text" dest="text_nospaces" />
But it is not searching the right terms . we are stripping the spaces and
indexing lowercase values when we do that.
Like : East Enders
when I seach for 'east end ers' text, its not returning any values saying
no document found.
I realised the solr uses QueryParser before passing query string to the
QueryAnalyzer in defined in schema.
And The Query parser is tokenizing the query string providing in query . So
it is sending each token to the QueryAnalyser that is defined in schema.
SO is there anyway that I can by pass this query parser or use a correct
query processor which can consider the entire string as single pharse.
At the moment I am using dismax query processor.
Any suggestion would be much appreciated.
Thanks
Srinivasa
--
View this message in context:
http://lucene.472066.n3.nabble.com/KeywordTokenizerFactory-trouble-with-exact-matches-tp4114193p4114432.html
Sent from the Solr - User mailing list archive at Nabble.com.