Tomcat needs a flag that tells it to use UTF-8. If you don't set that
various problems happen, including this one. Look on the solr wiki for
Tomcat and UTF-8.
Also, there can't be any blank lines at the top of the XML file before
the XML header.
Can you post a very short XML file that has this pr
This often happens if there is some character at the very beginning
of the XML document, outside of any tags, here:
character ''' (code 39) in prolog; expected '<'
at [row,col {unknown-source}]: [1,1]
But you indicate that this is happening for every document? If that's
the case, it may be an en