Thanks a lot Shawn!  You were right, I didn't think to look into Tomcat.  I 
enabled UTF8 in tomcat and everything works now, Thanks

Vincent Vu Nguyen


-----Original Message-----
From: Shawn Heisey [mailto:s...@elyograg.org] 
Sent: Wednesday, December 05, 2012 10:12 AM
To: solr-user@lucene.apache.org
Subject: Re: Turn on Unicode support

On 12/5/2012 7:31 AM, Nguyen, Vincent (CDC/OD/OADS) (CTR) wrote:
> Is there a way to turn on support for Unicode characters in version 1.4.1?  
> The strange thing is that my coworker and I are supposed to have the same 
> configuration, yet on her machine, there seems to be Unicode support enabled.
>
> For example, if I use the SOLR admin to do a search for the a term with the 
> 'Registered trademark ®' character, it will translate to 'â®'

Solr has full UTF8/Unicode support already.  It runs in a servlet container, 
like Jetty, Tomcat, etc.  The container must also be set up to use the UTF8 
character set.  For interaction with Solr, your browser must also be set up for 
UTF8, which in some cases may mean your OS needs to have its locale settings 
changed.

Whatever source of index data you are using must also provide UTF8 data to 
Solr.  For example, if your installation of Solr is using the dataimport 
handler, you must also set up your data source as UTF8.

Thanks,
Shawn

Reply via email to