Re: tomcat55/solr1.3 - Indexing data, doesnt take in consideration utf8!

2008-10-21 Thread sunnyfr
It actually come from the database mysql's variable : | character_set_client| latin1 | | character_set_connection| latin1 | so I don't know really now how to configure my data

Re: tomcat55/solr1.3 - Indexing data, doesnt take in consideration utf8!

2008-10-21 Thread sunnyfr
Hi Jerome, I tried to chat with you but you wasn't there or ...?? lol on your website. Ok I tried what you did and my file bring me back in gedit : 00ALL2006-10-10T05:29:32ZAll Japan Women's Pro-wrestling

WWWA Champion Title Match

豐田真奈美 VS 井上京子
<

Re: tomcat55/solr1.3 - Indexing data, doesnt take in consideration utf8!

2008-10-21 Thread Jérôme Etévé
Looks like you have a double encoding problem. It might be because you fetch UTF-8 binary data from mysql (I know that for instance the perl driver has an issue with that) and you then encode it a second time in UTF-8 when you post to solr. Make sure the string you're getting from mysql are actua