Hi!


I am trying to full text index an XML file. For various reasons, I
cannot use Tika or other technology to parse the XML file. The
requirement is to full-text index the XML file, including Tags and
everything.

 

So, I created a input index spec like this:

 

<add>

<doc>

<field name="id">1001</field>

<field name="name">NASA Advanced Research Labs</field>

<field name="address">1010 Main Street, Chattanooga, FL 32212</field>

<field name="content"><listing><id>1001</id>< name > NASA Advanced
Research Labs </ name ><address>1010 main street, chattanooga, FL
32212</address></listing></field>

</doc>

</add>

 

When I try to pump this into SLOR with "java -jar post.jar" I get an
exception saying:

 

SimplePostTool: version 1.2

SimplePostTool: WARNING: Make sure your XML documents are encoded in
UTF-8, other encodings are not currently supported

SimplePostTool: POSTing files to http://localhost:8983/solr/update..

SimplePostTool: POSTing file index.doc

SimplePostTool: FATAL: Solr returned an error:
unexpected_XML_tag_doclisting

 

Any idea what I am doing wrong? Does the Solr index generator support
inner XML content in it's field tags? I tried enclosing the innerXML in
![CDATA[]] but that didn't work either.

 

Any help appreciated.

 

Thanks

 

Feroze.

Reply via email to