Re: exception with xml file processing

2010-12-27 Thread Lance Norskog
Tomcat needs a flag that tells it to use UTF-8. If you don't set that various problems happen, including this one. Look on the solr wiki for Tomcat and UTF-8. Also, there can't be any blank lines at the top of the XML file before the XML header. Can you post a very short XML file that has this pr

Re: exception with xml file processing

2010-12-27 Thread Erick Erickson
This often happens if there is some character at the very beginning of the XML document, outside of any tags, here: character ''' (code 39) in prolog; expected '<' at [row,col {unknown-source}]: [1,1] But you indicate that this is happening for every document? If that's the case, it may be an en