Re: Solr xml img parsing exception

2013-11-15 Thread Marcello Lorenzi
Hi Jack, we have analyzed the issue and there were duplicated jar into the tomcat classpath for Tika. After the removal of the dulicated library now the search engine works as expected. Thanks for the support, Marcello On 11/14/2013 05:24 PM, Jack Krupansky wrote: The actual error appears to

Re: Solr xml img parsing exception

2013-11-14 Thread Jack Krupansky
The actual error appears to be: Caused by: org.xml.sax.SAXParseException; lineNumber: 91; columnNumber: 105; The element type "img" must be terminated by the matching end-tag "". So, check the input document at line 91, column 105. There should be an tag there, but SAX is complaining that ther

Re: Solr xml img parsing exception

2013-11-14 Thread Marcello Lorenzi
Hi Erik, but in this case the custom loader receives an HTTP Error 500 by SOLR? Thanks, Marcello On 11/14/2013 04:29 PM, Erik Hatcher wrote: Also there's a custom loader here that is the culprit: com.lsegroup.solr.handler.CwsExtractingDocumentLoader On Nov 14, 2013, at 10:20, Erick Erickson

Re: Solr xml img parsing exception

2013-11-14 Thread Erik Hatcher
Also there's a custom loader here that is the culprit: com.lsegroup.solr.handler.CwsExtractingDocumentLoader On Nov 14, 2013, at 10:20, Erick Erickson wrote: > It looks like bad data. The XML you're sending to Solr looks mal-formed, so > I > suspect this is completely outside of Solr's purview

Re: Solr xml img parsing exception

2013-11-14 Thread Erick Erickson
It looks like bad data. The XML you're sending to Solr looks mal-formed, so I suspect this is completely outside of Solr's purview. Best, Erick On Thu, Nov 14, 2013 at 9:26 AM, Marcello Lorenzi wrote: > Hi, > I have installed a Solr 4.3 instance and we have configured manifoldcf to > pass web c