Tomcat needs a flag that tells it to use UTF-8. If you don't set that
various problems happen, including this one. Look on the solr wiki for
Tomcat and UTF-8.

Also, there can't be any blank lines at the top of the XML file before
the XML header.

Can you post a very short XML file that has this problem?

On Mon, Dec 27, 2010 at 5:43 AM, Erick Erickson <erickerick...@gmail.com> wrote:
> This often happens if there is some character at the very beginning
> of the XML document, outside of any tags, here:
>
> character ''' (code 39) in prolog; expected '<'
>  at [row,col {unknown-source}]: [1,1]
>
> But you indicate that this is happening for every document? If that's
> the case, it may be an encoding issue. Make sure your servlet container
> character encoding handles the character set in your
> documents (UTF-8?).
>
> If that doesn't help, please show us an example of a file that doesn't work,
> the version of Solr you're using, info about your servlet, etc.
>
> Best
> Erick
>
> On Sun, Dec 26, 2010 at 10:24 PM, xu cheng <xcheng....@gmail.com> wrote:
>
>> hi all:
>>  I use solr to index my documents, and I put my text in a cdata
>> segment.however, solr always throws an exception complaining about
>> thexml file processing
>> .
>>  It seems that I can still index the document successfully!!!(actually ,
>> I'm
>> not sure about cos there are pretty too many document!)
>>
>>
>> the exception stack is like this: and all the exception infos are the same
>>
>>
>>
>>
>>  Error processing "legacy" update
>> command:com.ctc.wstx.exc.WstxUnexpectedCharException: Une
>> xpected character ''' (code 39) in prolog; expected '<'
>>  at [row,col {unknown-source}]: [1,1]
>>        at
>> com.ctc.wstx.sr.StreamScanner.throwUnexpectedChar(StreamScanner.java:648)
>>        at
>>
>> com.ctc.wstx.sr.BasicStreamReader.nextFromProlog(BasicStreamReader.java:2047)
>>        at
>> com.ctc.wstx.sr.BasicStreamReader.next(BasicStreamReader.java:1069)
>>        at
>> org.apache.solr.handler.XMLLoader.processUpdate(XMLLoader.java:90)
>>        at
>>
>> org.apache.solr.handler.XmlUpdateRequestHandler.doLegacyUpdate(XmlUpdateRequestHandle
>> r.java:130)
>>        at
>> org.apache.solr.servlet.SolrUpdateServlet.doPost(SolrUpdateServlet.java:79)
>>        at javax.servlet.http.HttpServlet.service(HttpServlet.java:637)
>>        at javax.servlet.http.HttpServlet.service(HttpServlet.java:717)
>>        at
>>
>> org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterCha
>> in.java:290)
>>        at
>>
>> org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:
>> 206)
>>        at
>>
>> org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:286)
>>        at
>>
>> org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterCha
>> in.java:235)
>>        at
>>
>> org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:
>> 206)
>>        at
>>
>> org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:233)
>>        at
>>
>> org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:191)
>>        at
>>
>> org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
>>        at
>>
>> org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:102)
>>        at
>>
>> org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109)
>>        at
>> org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:298)
>>        at
>> org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:857)
>>        at
>>
>> org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protoco
>> l.java:588)
>>        at
>> org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:489)
>>        at java.lang.Thread.run(Thread.java:619)
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> any suggestion and reference are appreciated! thanks
>>
>



-- 
Lance Norskog
goks...@gmail.com

Reply via email to