Re: Update with non UTF-8 characters

2014-10-01 Thread Chris Hostetter
: I am indexing Solr 4.9.0 using the /update request handler and am getting : errors from Tika - Illegal IOException from : org.apache.tika.parser.xml.DcXMLParser@74ce3bea which is caused by : MalFormedByteSequenceException: Invalid byte 1 of 1-byte UTF-8 sequence. I FWIW: that error appears to h

Update with non UTF-8 characters

2014-10-01 Thread Teague James
Hello! I am indexing Solr 4.9.0 using the /update request handler and am getting errors from Tika - Illegal IOException from org.apache.tika.parser.xml.DcXMLParser@74ce3bea which is caused by MalFormedByteSequenceException: Invalid byte 1 of 1-byte UTF-8 sequence. I believe that this is the result