Right, that will not work at all for direct transmission to Solr. You could write a Java program that parses this and sends it to Solr via SolrJ.
Personally I haven't connected a database to Solr with XPathEntityProcessor in the mix, but I believe I've seen messages go by with this configuration. You might want to search the mail archive... Best Erick On Fri, Apr 13, 2012 at 3:13 PM, srini <softtec...@gmail.com> wrote: > Erick, > > Thanks for your reply. when you say Solr does not index arbitery xml > document, then below is the way my xml document looks like which is sitting > in oracle. Could you suggest the best of indexing it ? which method should I > follow? Should I use XPathEntityProcessor? > > <?xml version="1.0" encoding="UTF-8" ?> > <message xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" > xmlns="someurl" xmlns:csp="someurl.xsd" xsi:schemaLocation="somelocation > jar: id="002" message-type="create"> > <content> > <dsp:row> > <dsp:channel>100</dsp:channel> > <dsp:role>115</dsp:role> > </dsp:row> > > </body></content></message> > > Thanks in Advance > Erick Erickson wrote >> >> Solr does not index arbitrary XML content. There is and XML >> form of a solr document that can be sent to Solr, but it is >> a specific form of XML. >> >> An example of the XML you're trying to index and what you mean >> by "not working" would be helpful. >> >> Best >> Erick >> >> On Fri, Apr 13, 2012 at 11:50 AM, srini <softtech88@> wrote: >>> not sure why CDATA part did not get interpreted. this is how xml content >>> looks like. I added quotes just to present the exact content xml content. >>> >>> "<body></body>" >>> >>> -- >>> View this message in context: >>> http://lucene.472066.n3.nabble.com/Solr-is-not-extracting-the-CDATA-part-of-xml-tp3908317p3908341.html >>> Sent from the Solr - User mailing list archive at Nabble.com. >> > > Erick Erickson wrote >> >> Solr does not index arbitrary XML content. There is and XML >> form of a solr document that can be sent to Solr, but it is >> a specific form of XML. >> >> An example of the XML you're trying to index and what you mean >> by "not working" would be helpful. >> >> Best >> Erick >> >> On Fri, Apr 13, 2012 at 11:50 AM, srini <softtech88@> wrote: >>> not sure why CDATA part did not get interpreted. this is how xml content >>> looks like. I added quotes just to present the exact content xml content. >>> >>> "<body></body>" >>> >>> -- >>> View this message in context: >>> http://lucene.472066.n3.nabble.com/Solr-is-not-extracting-the-CDATA-part-of-xml-tp3908317p3908341.html >>> Sent from the Solr - User mailing list archive at Nabble.com. >> > > Erick Erickson wrote >> >> Solr does not index arbitrary XML content. There is and XML >> form of a solr document that can be sent to Solr, but it is >> a specific form of XML. >> >> An example of the XML you're trying to index and what you mean >> by "not working" would be helpful. >> >> Best >> Erick >> >> On Fri, Apr 13, 2012 at 11:50 AM, srini <softtech88@> wrote: >>> not sure why CDATA part did not get interpreted. this is how xml content >>> looks like. I added quotes just to present the exact content xml content. >>> >>> "<body></body>" >>> >>> -- >>> View this message in context: >>> http://lucene.472066.n3.nabble.com/Solr-is-not-extracting-the-CDATA-part-of-xml-tp3908317p3908341.html >>> Sent from the Solr - User mailing list archive at Nabble.com. >> > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Solr-is-not-extracting-the-CDATA-part-of-xml-tp3908317p3908791.html > Sent from the Solr - User mailing list archive at Nabble.com.