1) The XML file must include the UTF-8 encoding metadata in the first line.
2) If you are using Tomcat: Tomcat comes without UTF-8 as the default.
The Solr wiki gives the directions on how to fix this.
3) If you are using Windows: Windows does not use UTF-8 by default.
Tracking down UTF-8 encoding problems almost drove me crazy :)
Good luck!
Darx Oman wrote:
Hi amier
try saving the xml file encoding as UTF-8
On Mon, Sep 6, 2010 at 11:08 AM, Darx Darx<darxo...@msn.com> wrote:
Date: Mon, 6 Sep 2010 10:10:25 +0500
Subject: How to enable Unicode Support in Solr
From: am...@techarete.com
To: solr-user@lucene.apache.org
I have an index that takes textual description and places it in the
index. I
am creating an XML file and passing it to Solr for indexing, but Solr is
not
saving Unicode characters as it is showing question mark for those
characters. I want to know that how to enable Unicode Support for Solr
index.
Regards,
Amier Haider