Here is a situation I now experience: What Solr has: economist and thus …@en What was sent: economist and thus …@en where those are just snippets from what I sent up -- the ellipsis was created by Carrot2, and what comes back when I fetch the document with that passage.
There is a hint in the Solr FAQ that the server must support UTF-8; it's not clear how to do that from HTTPSolrServer. Other hints from around the web suggest I should be using a different field than type = "string" I should point out that I am running these developmental tests on the Solr 4 example build with my schema.xml. My question is this: what simple, say, utility call would return the text to its original? (perhaps that's the wrong question...) Many thank in advance Jack