This is invalid XML. Entities must be encoded or embedded within CDATA tags.

On Wednesday 13 April 2011 16:10:51 Rosa (Anuncios) wrote:
> Hi
> 
> I'm having an error when i import an xml file with DIH.
> 
> In this file my <id> is an url wich looks like this :
> 
> http://www.example.com/?cp=30_s&st=a&c=655
> 
> Apparently the issue is with the "=" character?
> 
> Is there any workaround?
> 
> Error trace:
>   rows processed:0 Processing Document # 849
>          at
> org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(
> DataImportHandlerException.java:72) at
> org.apache.solr.handler.dataimport.XPathEntityProcessor.initQuery(XPathEnti
> tyProcessor.java:328) at
> org.apache.solr.handler.dataimport.XPathEntityProcessor.fetchNextRow(XPathE
> ntityProcessor.java:224) at
> org.apache.solr.handler.dataimport.XPathEntityProcessor.nextRow(XPathEntity
> Processor.java:204) at
> org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityPro
> cessorWrapper.java:238) at
> org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java
> :591) at
> org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java
> :617) at
> org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:26
> 7) at
> org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:186)
>          at
> org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.j
> ava:353) at
> org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:41
> 1) at
> org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:392
> ) Caused by: java.lang.RuntimeException:
> com.ctc.wstx.exc.WstxUnexpectedCharException: Unexpected character '='
> (code 61); expected a semi-colon after the reference for entity 'st'
>   at [row,col {unknown-source}]: [2,199]
>          at
> org.apache.solr.handler.dataimport.XPathRecordReader.streamRecords(XPathRec
> ordReader.java:187) at
> org.apache.solr.handler.dataimport.XPathEntityProcessor.initQuery(XPathEnti
> tyProcessor.java:318) ... 10 more
> Caused by: com.ctc.wstx.exc.WstxUnexpectedCharException: Unexpected
> character '=' (code 61); expected a semi-colon after the reference for
> entity 'st'
>   at [row,col {unknown-source}]: [2,199]
>          at
> com.ctc.wstx.sr.StreamScanner.throwUnexpectedChar(StreamScanner.java:648)
>          at
> com.ctc.wstx.sr.StreamScanner.parseEntityName(StreamScanner.java:1994)
>          at
> com.ctc.wstx.sr.StreamScanner.fullyResolveEntity(StreamScanner.java:1496)
>          at
> com.ctc.wstx.sr.BasicStreamReader.skipTokenText(BasicStreamReader.java:3568
> ) at
> com.ctc.wstx.sr.BasicStreamReader.skipToken(BasicStreamReader.java:3342)
>          at
> com.ctc.wstx.sr.BasicStreamReader.nextFromTree(BasicStreamReader.java:2622)
>          at
> com.ctc.wstx.sr.BasicStreamReader.next(BasicStreamReader.java:1019)
>          at
> org.apache.solr.handler.dataimport.XPathRecordReader$Node.handleStartElemen
> t(XPathRecordReader.java:376) at
> org.apache.solr.handler.dataimport.XPathRecordReader$Node.parse(XPathRecord
> Reader.java:310) at
> org.apache.solr.handler.dataimport.XPathRecordReader$Node.handleStartElemen
> t(XPathRecordReader.java:346) at
> org.apache.solr.handler.dataimport.XPathRecordReader$Node.parse(XPathRecord
> Reader.java:310) at
> org.apache.solr.handler.dataimport.XPathRecordReader$Node.handleStartElemen
> t(XPathRecordReader.java:346) at
> org.apache.solr.handler.dataimport.XPathRecordReader$Node.parse(XPathRecord
> Reader.java:310) at
> org.apache.solr.handler.dataimport.XPathRecordReader$Node.access$200(XPathR
> ecordReader.java:202) at
> org.apache.solr.handler.dataimport.XPathRecordReader.streamRecords(XPathRec
> ordReader.java:184) ... 11 more
> 
> Apr 13, 2011 4:00:25 PM org.apache.solr.update.DirectUpdateHandler2
> rollback INFO: start rollback
> Apr 13, 2011 4:00:25 PM org.apache.solr.update.DirectUpdateHandler2
> rollback INFO: end_rollback

-- 
Markus Jelsma - CTO - Openindex
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350

Reply via email to