Re: exception with xml file processing

2010-12-27 Thread Lance Norskog
Tomcat needs a flag that tells it to use UTF-8. If you don't set that various problems happen, including this one. Look on the solr wiki for Tomcat and UTF-8. Also, there can't be any blank lines at the top of the XML file before the XML header. Can you post a very short XML file that has this pr

Re: exception with xml file processing

2010-12-27 Thread Erick Erickson
This often happens if there is some character at the very beginning of the XML document, outside of any tags, here: character ''' (code 39) in prolog; expected '<' at [row,col {unknown-source}]: [1,1] But you indicate that this is happening for every document? If that's the case, it may be an en

exception with xml file processing

2010-12-26 Thread xu cheng
hi all: I use solr to index my documents, and I put my text in a cdata segment.however, solr always throws an exception complaining about thexml file processing . It seems that I can still index the document successfully!!!(actually , I'm not sure about cos there are pretty too many document!)