I think best way to do this is to modify 
org.apache.lucene.index.memory.SynonymTokenFilter and employ this filter index 
time.

if token.termBuffer() has one those (á, à, â, ä, ã, å) characters you will 
replace it with its equvalent ascii character (a). Then you will inject this 
new Token as a Synonym.

I don't know is it the best way but it will give you what you want.

--- On Wed, 2/18/09, radarghost <radargh...@yahoo.com> wrote:

> From: radarghost <radargh...@yahoo.com>
> Subject: foreign characters equivalent in solr search
> To: solr-user@lucene.apache.org
> Date: Wednesday, February 18, 2009, 4:28 PM
> we are using solr 1.2 and dont want to upgrade to 1.3 till
> official release
> for Debian.
> i want solr to search for equivalent of a foreign chracter
> for getting
> better results
> 
> in example:
> 
> if a user searches for Tiesto which is indexed in this
> format Tiësto in our
> solr. we want solr also return result
> return search result for á, à, â, ä, ã, å where they
> are in word but that
> word has been searched with normal a
> e for ë, i for ï, o for ö, and so on
> 
> any solution?
> 
> hope i could tell what i need with my poor English
> 
> thanks
> 
> 
> -- 
> View this message in context:
> http://www.nabble.com/foreign-characters-equivalent-in-solr-search-tp22079912p22079912.html
> Sent from the Solr - User mailing list archive at
> Nabble.com.



Reply via email to