Re: Normalizing multiple Chars with MappingCharFilter possible?

Koji Sekiguchi Tue, 24 Nov 2009 03:30:32 -0800

Andreas Kahl wrote:

Hello everyone,
is it possible to normalize Strings like '`e' (2 chars) => 'e' (in contrast to 'é' 
(1 char) => 'e') with org.apache.lucene.analysis.MappingCharFilter?
I am asking this because I am considering to index some multilingual and multi-alphabetic data with Solr which uses such Strings as a substitution for 'real' Unicode characters.Thanks for your advice.
Andreas

Yes. It should work.
MappingCharFilter supports:

* char-to-char
* string-to-char
* char-to-string
* string-to-string

without misalignment of original offsets (i.e. highlighter works
correctly with MappingCharFilters).

Koji

--
http://www.rondhuit.com/en/

Re: Normalizing multiple Chars with MappingCharFilter possible?

Reply via email to