Hi, I tested the new filters' configuration and it works fine. <fieldType name="text" class="solr.TextField" positionIncrementGap="100"> <analyzer type="index"> <tokenizer class="solr.WhitespaceTokenizerFactory"/> <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="1" catenateNumbers="1" catenateAll="0" splitOnCaseChange="0"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> <filter class="solr.ISOLatin1AccentFilterFactory" /> </analyzer> <analyzer type="query"> <tokenizer class="solr.WhitespaceTokenizerFactory"/> <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="1" catenateNumbers="1" catenateAll="0" splitOnCaseChange="0"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> <filter class="solr.ISOLatin1AccentFilterFactory" /> </analyzer> </fieldType>
The problem about ISOLatin1AccentFilterFactory was not due to Solr, but to a core-dependent configuration in a Solr multi-core environment. It was only necessary to set to 0 the property 'splitOnCaseChange' in solr.WordDelimiterFilterFactory. Thanks for your support, Marco Marco Branca Consultant Sytel Reply S.r.l. Via Ripamonti, 89 - 20139 Milano Mobile: (+39) 348 2298186 e-mail: m.bra...@reply.it Website: www.reply.eu ________________________________________ Da: Ensdorf Ken [ensd...@zoominfo.com] Inviato: venerdì 22 maggio 2009 18.16 A: 'solr-user@lucene.apache.org' Oggetto: RE: Filtering query terms > When I try testing the filter "solr.LowerCaseFilterFactory" I get > different results calling the following urls: > > 1. http://[server-ip]:[server-port]/solr/[core- > name]/select/?q=all%3Apapa&version=2.2&start=0&rows=10&indent=on > 2. http://[server-ip]:[server-port]/solr/[core- > name]/select/?q=all%3APaPa&version=2.2&start=0&rows=10&indent=on In this case, the WordDelimiterFilterFactory is kicking in on your second search, so "APaPa" is split into "APa" and "Pa". You can double-check this by using the analysis tool in the admin UI - http://localhost:8983/solr/admin/analysis.jsp > > Besides, when trying to test the "solr.ISOLatin1AccentFilterFactory" I > get different results calling the following urls: > > 1. http://[server-ip]:[server-port]/solr/[core- > name]/select/?q=all%3Apapa&version=2.2&start=0&rows=10&indent=on > 2. http://[server-ip]:[server-port]/solr/[core- > name]/select/?q=all%3Apapà &version=2.2&start=0&rows=10&indent=on Not sure what it happening here, but again I would check it with the analysi tool -- The information transmitted is intended for the person or entity to which it is addressed and may contain confidential and/or privileged material. Any review, retransmission, dissemination or other use of, or taking of any action in reliance upon, this information by persons or entities other than the intended recipient is prohibited. If you received this in error, please contact the sender and delete the material from any computer.