Generally, stemming filters are not supported when wildcards are present. Only a small subset of filters work with wildcards, such as the case conversion filters.

But, you stay that you are using the stemmer to remove diacritical marks... you can/should use ASCIIFoldingFilterFactory or MappingCharFilterFactory.

-- Jack Krupansky

-----Original Message----- From: Román González
Sent: Monday, May 5, 2014 7:00 AM
To: solr-user@lucene.apache.org
Subject: Wildcard malfunctioning

Hi all!



Sorry in advance if this question was posted but I were unable to find it
with search engines.



Filter SpanishLightStemFilterFactory is not working properly with wildcards
or I’m misunderstanding something. I have the field



  <field name="cultivo_es" type="text_es" indexed="true" stored="true" />



With this type:



   <fieldType name="text_es" class="solr.TextField"
positionIncrementGap="100">

     <analyzer>

       <tokenizer class="solr.StandardTokenizerFactory"/>

       <filter class="solr.LowerCaseFilterFactory"/>

       <filter class="solr.StopFilterFactory" ignoreCase="true"
words="lang/stopwords_es.txt" format="snowball" />

       <filter class="solr.SpanishLightStemFilterFactory"/>

       <!-- more aggressive: <filter
class="solr.SnowballPorterFilterFactory" language="Spanish"/> -->

     </analyzer>

   </fieldType>



But I’m getting these results:



q = cultivo_es:uva

Getting 50 correct results



q = cultivo_es:uva*

Getting the same 50 correct results



q = cultivo_es:naranja

Getting the 50 correct results of “naranja”



q = cultivo_es:naranja*

Getting the 0 results !!!!!!!!!



It works fine if I remove SpanishLightStemFilterFactory filter, but I need
it in order to filter diacritics according to Spanish rules.



Thank you!!



Reply via email to