Hi, I've solr 1.3 and tomcat55. When I try to index a bit of data and I request ALL, obviously my accent and UTF8 encoding is not took in consideration. <doc> <date name="created">2006-12-14T15:28:27Z</date> <str name="description_ja"> Le 1er film de Goro Miyazaki (fils de Hayao) <br />je suis allée ... .... <str name="title_ja">渡邊 å‰å· vs 三田下田 1</str>
My database Mysql is well in UTF8, if I request data manually from mysql I will get accent even japan characters properly I index my data, my data-config is : <dataSource type="JdbcDataSource" driver="com.mysql.jdbc.Driver" url="jdbc:mysql://master-spare.videos.com/videos" user="solr" password="pass" batchSize="-1" responseBuffering="adaptive"/> My schema config file start by : <?xml version="1.0" encoding="UTF-8" ?> I've add in my server.xml : because my localhost point on 8180 <Connector port="8180" maxHttpHeaderSize="8192" maxThreads="150" minSpareThreads="25" maxSpareThreads="75" enableLookups="false" redirectPort="8443" acceptCount="100" connectionTimeout="20000" disableUploadTimeout="true" URIEncoding="UTF-8" useBodyEncodingForURI="true" /> What can I check? I'm using a linux server. If I do dpkg-reconfigure -plow locales Generating locales... fr_BE.UTF-8... up-to-date fr_CA.UTF-8... up-to-date fr_CH.UTF-8... up-to-date fr_FR.UTF-8... up-to-date fr_LU.UTF-8... up-to-date Generation complete. Would that be a problem, I would say no but maybe, do I miss a package??? -- View this message in context: http://www.nabble.com/tomcat55-solr1.3---Indexing-data%2C-doesnt-take-in-consideration-utf8%21-tp20086167p20086167.html Sent from the Solr - User mailing list archive at Nabble.com.