On 12/5/2012 7:31 AM, Nguyen, Vincent (CDC/OD/OADS) (CTR) wrote:
Is there a way to turn on support for Unicode characters in version 1.4.1? The
strange thing is that my coworker and I are supposed to have the same
configuration, yet on her machine, there seems to be Unicode support enabled.
For example, if I use the SOLR admin to do a search for the a term with the
'Registered trademark ®' character, it will translate to 'â®'
Solr has full UTF8/Unicode support already. It runs in a servlet
container, like Jetty, Tomcat, etc. The container must also be set up
to use the UTF8 character set. For interaction with Solr, your browser
must also be set up for UTF8, which in some cases may mean your OS needs
to have its locale settings changed.
Whatever source of index data you are using must also provide UTF8 data
to Solr. For example, if your installation of Solr is using the
dataimport handler, you must also set up your data source as UTF8.
Thanks,
Shawn