Re: Character encoding problems

2014-08-07 Thread Chris Hostetter
It's not clear to me from any of the comments you've made in this thread wether you've ever confirmed *exactly* what you are getting back from solr, ignoring the PHP completley. (ie: you refer to "UTF-8 for all of the web pages" suggesting you are only looking at some web application which is

Re: Character encoding problems

2014-07-30 Thread Gulliver Smith
Thanks for all the replies - I should have made clear that the first thing I did was confirm that everything on the PHP side is UTF-8. The web pages, the input text, the input files etc. The browser confirms that the encoding is UTF-8 for all of the web pages, the response headers as inspected by t

Re: Character encoding problems

2014-07-29 Thread Paul Libbrecht
> If you are seeing " appelé au téléphone" in the browser, I would guess > that the data is being rendered in UTF-8 by your server and the content type > of the html is set to iso-8859-1 or not being set and your browser is > defaulting to iso-8859-1. > > You can force the encoding to utf-8

Re: Character encoding problems

2014-07-29 Thread François Schiettecatte
Hi If you are seeing " appelé au téléphone" in the browser, I would guess that the data is being rendered in UTF-8 by your server and the content type of the html is set to iso-8859-1 or not being set and your browser is defaulting to iso-8859-1. You can force the encoding to utf-8 in the

Re: Character encoding problems

2014-07-29 Thread Gulliver Smith
Thanks for the information about URIEncoding="UTF-8" in the tomcat conf file, but that doesn't answer my main concerns: - what is the character encoding of the text in the title_fr field? - is there any way to force it to be UTF-8? On Tue, Jul 29, 2014 at 8:35 AM, wrote: > Hi, > > If you use sol

Re: Character encoding problems

2014-07-29 Thread aurelien . mazoyer
Hi, If you use solr 4.8.1, you don't have to add URIEncoding="UTF-8" in the tomcat conf file anymore : https://wiki.apache.org/solr/SolrTomcat Regards, Aurélien MAZOYER On 29.07.2014 14:22, Gulliver Smith wrote: I have solr 4.8.1 under Tomcat 7 on Debian Linux. The connector in Tomcat's se

Character encoding problems

2014-07-29 Thread Gulliver Smith
I have solr 4.8.1 under Tomcat 7 on Debian Linux. The connector in Tomcat's server.xml has been changed to include character encoding UTF-8: I am posting to the server from PHP 5.5 curl. The extract POST was intercepted and confirmed that everything is being encode in UTF-8. However, the resp