Hi,

I've solr 1.3 and tomcat55.
When I try to index a bit of data and I request ALL, obviously my accent and
UTF8 encoding is not took in consideration.
<doc>
<date name="created">2006-12-14T15:28:27Z</date>
<str name="description_ja">
Le 1er film de Goro Miyazaki (fils de Hayao)
<br />je suis allée  ...
....
<str name="title_ja">渡邊 前川 vs 三田下田 1</str>


My database Mysql is well in UTF8, if I request data manually from mysql I
will get accent even japan characters properly

I index my data, my data-config is :
  <dataSource type="JdbcDataSource"
              driver="com.mysql.jdbc.Driver"
              url="jdbc:mysql://master-spare.videos.com/videos"
              user="solr"
              password="pass"
              batchSize="-1"
              responseBuffering="adaptive"/>

My schema config file start by : <?xml version="1.0" encoding="UTF-8" ?>

I've add in my server.xml : because my localhost point on 8180
    <Connector port="8180" maxHttpHeaderSize="8192"
               maxThreads="150" minSpareThreads="25" maxSpareThreads="75"
               enableLookups="false" redirectPort="8443" acceptCount="100"
               connectionTimeout="20000" disableUploadTimeout="true"
URIEncoding="UTF-8" useBodyEncodingForURI="true" />

What can I check?
I'm using a linux server.
If I do dpkg-reconfigure -plow locales
Generating locales...
  fr_BE.UTF-8... up-to-date
  fr_CA.UTF-8... up-to-date
  fr_CH.UTF-8... up-to-date
  fr_FR.UTF-8... up-to-date
  fr_LU.UTF-8... up-to-date
Generation complete.

Would that be a problem, I would say no but maybe, do I miss a package???



-- 
View this message in context: 
http://www.nabble.com/tomcat55-solr1.3---Indexing-data%2C-doesnt-take-in-consideration-utf8%21-tp20086167p20086167.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to