I am trying to match the keywords with / without white space but one of the
case fails always..

For ex:

I am indexing 4 documents

name: wal mart
name: walmart
name: WalMart
name: Walmart

Now searching on name either using
wal mart
walmart
Walmart
WalMart

should return all the above 4 documents but searching using keyword 'wal
mart' returns only the first document and not the remaining 3 documents.

I am using shingle filter factory to create combination of the words during
indexing. Please find below my configuration. Can someone let me know where
I am wrong?

      <fieldType name="shingleString" class="solr.TextField"
omitNorms="true">
          <analyzer type="index">
             <charFilter class="solr.PatternReplaceCharFilterFactory"
              pattern="'+" replacement=""/>
              <tokenizer class="solr.WhitespaceTokenizerFactory"/>
              <filter class="solr.ASCIIFoldingFilterFactory"/>
              <filter class="solr.ShingleFilterFactory" minShingleSize="2"
              maxShingleSize="3" outputUnigrams="true"/>
              <filter class="solr.PatternReplaceFilterFactory" pattern="\W+"
              replacement=""/>
              <filter class="solr.LowerCaseFilterFactory"/>
          </analyzer>
         <analyzer type="query">
             <tokenizer class="solr.WhitespaceTokenizerFactory"/>
             <filter class="solr.ShingleFilterFactory" minShingleSize="2"
             maxShingleSize="99" outputUnigrams="true"/>
             <filter class="solr.PatternReplaceFilterFactory" pattern="\W+"
             replacement=""/>
             <filter class="solr.LowerCaseFilterFactory"/>
         </analyzer>
    </fieldType>
  </types>





--
View this message in context: 
http://lucene.472066.n3.nabble.com/SOLR-matching-keywords-with-without-whitespace-tp4082244.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to