On 8/1/2013 7:20 AM, Federico Chiacchiaretta wrote:
> on data import from a PostgreSQL db, I get the following error in solr.log:
> 
> ERROR - 2013-08-01 09:51:00.217; org.apache.solr.common.SolrException;
> shard update error RetryNode:
> http://172.16.201.173:8983/solr/archive/:org.apache.solr.client.solrj.impl.HttpSolrServer$RemoteSolrException:
> Invalid
> UTF-8 character 0xfffe at char #416, byte #127)

It sounds like your database is not using the UTF-8 character set, but
the JDBC driver (or the driver-server combination) is not aware that the
character set is different.  Solr expects UTF-8.

Generally what you want to do is tell the JDBC driver to use the UTF-8
character set, which will hopefully cause either the driver or the DB
server to translate for you.

There is a charSet parameter for the postgresql jdbc driver:

http://jdbc.postgresql.org/documentation/80/connect.html

These are added to the jdbc URL after a ? character, just like
parameters on an http URL.

Thanks,
Shawn

Reply via email to