Re: Wildcard malfunctioning

Jack Krupansky Mon, 05 May 2014 04:20:08 -0700

Generally, stemming filters are not supported when wildcards are present.Only a small subset of filters work with wildcards, such as the caseconversion filters.

But, you stay that you are using the stemmer to remove diacritical marks...you can/should use ASCIIFoldingFilterFactory or MappingCharFilterFactory.


-- Jack Krupansky

-----Original Message-----From: Román González

Sent: Monday, May 5, 2014 7:00 AM
To: [email protected]
Subject: Wildcard malfunctioning

Hi all!

Sorry in advance if this question was posted but I were unable to find it
with search engines.

Filter SpanishLightStemFilterFactory is not working properly with wildcards
or I’m misunderstanding something. I have the field

  <field name="cultivo_es" type="text_es" indexed="true" stored="true" />

With this type:

   <fieldType name="text_es" class="solr.TextField"
positionIncrementGap="100">

     <analyzer>

       <tokenizer class="solr.StandardTokenizerFactory"/>

       <filter class="solr.LowerCaseFilterFactory"/>

       <filter class="solr.StopFilterFactory" ignoreCase="true"
words="lang/stopwords_es.txt" format="snowball" />

       <filter class="solr.SpanishLightStemFilterFactory"/>

       <!-- more aggressive: <filter
class="solr.SnowballPorterFilterFactory" language="Spanish"/> -->

     </analyzer>

   </fieldType>

But I’m getting these results:

q = cultivo_es:uva

Getting 50 correct results

q = cultivo_es:uva*

Getting the same 50 correct results

q = cultivo_es:naranja

Getting the 50 correct results of “naranja”

q = cultivo_es:naranja*

Getting the 0 results !!!!!!!!!

It works fine if I remove SpanishLightStemFilterFactory filter, but I need
it in order to filter diacritics according to Spanish rules.

Thank you!!

Re: Wildcard malfunctioning

Reply via email to