Looking at http://wiki.apache.org/solr/ExtractingRequestHandler:

Extract Only
"the output includes XML generated by Tika (and is hence further escaped by Solr's XML)"

...is there an option to NOT have the resulting TIKA output escaped?

so &lt;head&gt; would come back as <head/>

If no, what would need to be done to enable this option? Looked into SOLR-1274.patch, but didn't see a parameter for such a thing.

Thanks,

Dan

Reply via email to