Tomcat needs a flag that tells it to use UTF-8. If you don't set that
various problems happen, including this one. Look on the solr wiki for
Tomcat and UTF-8.
Also, there can't be any blank lines at the top of the XML file before
the XML header.
Can you post a very short XML file that has this pr
This often happens if there is some character at the very beginning
of the XML document, outside of any tags, here:
character ''' (code 39) in prolog; expected '<'
at [row,col {unknown-source}]: [1,1]
But you indicate that this is happening for every document? If that's
the case, it may be an en
hi all:
I use solr to index my documents, and I put my text in a cdata
segment.however, solr always throws an exception complaining about
thexml file processing
.
It seems that I can still index the document successfully!!!(actually , I'm
not sure about cos there are pretty too many document!)