I have solr 4.8.1 under Tomcat 7 on Debian Linux. The connector in Tomcat's server.xml has been changed to include character encoding UTF-8:
<Connector port="8080" protocol="HTTP/1.1" URIEncoding="UTF-8" connectionTimeout="20000" redirectPort="8443" /> I am posting to the server from PHP 5.5 curl. The extract POST was intercepted and confirmed that everything is being encode in UTF-8. However, the responses to query commands, whether XML or JSON are returning field values such as title_fr in something that looks like latin1 or iso-8859-1 when displayed in a browser or editor. E.g.: "title_fr":[" appelé au téléphone"] The highlights in the query response do have correctly displaying character codes. E.g. "text_fr":[" \n \n \n \n \n \n \n \n \n \n \nappelé au téléphone\nappelé au téléphone\n PHP's utf8_decode doesn't make sense of the title_fr. Is there something to configure to fix this and get proper UTF8 results for everything? Thanks Gulliver