Hi,

I am experiencing problems using filters.

I'm using the following version of Solr:
          solr/nightly of 2009-04-12

The part of the schema.xml I'm using for setting filters is the following:

    <fieldType name="text" class="solr.TextField" positionIncrementGap="100">
      <analyzer type="index">
        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
        <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" 
generateNumberParts="1" catenateWords="1" catenateNumbers="1" catenateAll="0" 
splitOnCaseChange="1"/>
        <filter class="solr.LowerCaseFilterFactory"/>
        <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
        <filter class="solr.ISOLatin1AccentFilterFactory" />
      </analyzer>
      <analyzer type="query">
        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
        <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" 
generateNumberParts="1" catenateWords="0" catenateNumbers="0" catenateAll="0" 
splitOnCaseChange="1"/>
        <filter class="solr.LowerCaseFilterFactory"/>
        <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
        <filter class="solr.ISOLatin1AccentFilterFactory" />
      </analyzer>
    </fieldType>

and the field I'm querying is a field called "all" declared as follows:

<field name="all" type="text" indexed="true" stored="true" multiValued="true"/>

When I try testing the filter "solr.LowerCaseFilterFactory" I get different 
results calling the following urls:

 1. 
http://[server-ip]:[server-port]/solr/[core-name]/select/?q=all%3Apapa&version=2.2&start=0&rows=10&indent=on
 2. 
http://[server-ip]:[server-port]/solr/[core-name]/select/?q=all%3APaPa&version=2.2&start=0&rows=10&indent=on

Besides, when trying to test the "solr.ISOLatin1AccentFilterFactory" I get 
different results calling the following urls:

 1. 
http://[server-ip]:[server-port]/solr/[core-name]/select/?q=all%3Apapa&version=2.2&start=0&rows=10&indent=on
 2. 
http://[server-ip]:[server-port]/solr/[core-name]/select/?q=all%3ApapĂ &version=2.2&start=0&rows=10&indent=on

Is it the expected behavior or it is a (known) bug? I would like to apply some 
filter converting all searched words in the corresponding lowercase version 
without accents.

Thanks for your help,

Marco


--
The information transmitted is intended for the person or entity to which it is 
addressed and may contain confidential and/or privileged material. Any review, 
retransmission, dissemination or other use of, or taking of any action in 
reliance upon, this information by persons or entities other than the intended 
recipient is prohibited. If you received this in error, please contact the 
sender and delete the material from any computer.

Reply via email to