Hi,

When you run your indexing app make sure you treat what you send to Solr as
UTF-8.
Use -Dfile.encoding=UTF8 -Dclient.encoding.override=UTF-8 to the Java
command line.

Otis
--
Solr & ElasticSearch Support
http://sematext.com/





On Mon, Jan 21, 2013 at 3:06 PM, Jack Park <jackp...@topicquests.org> wrote:

> Here is a situation I now experience:
>
>                         What Solr has:
>                                 economist and thus …@en
>                         What was sent:
>                                 economist and thus …@en
> where those are just snippets from what I sent up -- the ellipsis was
> created by Carrot2, and what comes back when I fetch the document with
> that passage.
>
> There is a hint in the Solr FAQ that the server must support UTF-8;
> it's not clear how to do that from HTTPSolrServer.
> Other hints from around the web suggest I should be using a different
> field than type = "string"
>
> I should point out that I am running these developmental tests on the
> Solr 4 example build with my schema.xml.
>
> My question is this: what simple, say, utility call would return the
> text to its original?
> (perhaps that's the wrong question...)
>
> Many thank in advance
> Jack
>

Reply via email to