I think best way to do this is to modify org.apache.lucene.index.memory.SynonymTokenFilter and employ this filter index time.
if token.termBuffer() has one those (á, à, â, ä, ã, å) characters you will replace it with its equvalent ascii character (a). Then you will inject this new Token as a Synonym. I don't know is it the best way but it will give you what you want. --- On Wed, 2/18/09, radarghost <radargh...@yahoo.com> wrote: > From: radarghost <radargh...@yahoo.com> > Subject: foreign characters equivalent in solr search > To: solr-user@lucene.apache.org > Date: Wednesday, February 18, 2009, 4:28 PM > we are using solr 1.2 and dont want to upgrade to 1.3 till > official release > for Debian. > i want solr to search for equivalent of a foreign chracter > for getting > better results > > in example: > > if a user searches for Tiesto which is indexed in this > format Tiësto in our > solr. we want solr also return result > return search result for á, à, â, ä, ã, å where they > are in word but that > word has been searched with normal a > e for ë, i for ï, o for ö, and so on > > any solution? > > hope i could tell what i need with my poor English > > thanks > > > -- > View this message in context: > http://www.nabble.com/foreign-characters-equivalent-in-solr-search-tp22079912p22079912.html > Sent from the Solr - User mailing list archive at > Nabble.com.