Hi Shalin, Good question; sorry I forgot it in the initial post. I have tried with both a nightly build from earlier this month (Oct 2 I believe) as well as a build from the trunk as of yesterday afternoon.
Adam On Tue, Oct 6, 2009 at 5:04 PM, Shalin Shekhar Mangar < shalinman...@gmail.com> wrote: > On Tue, Oct 6, 2009 at 9:29 PM, Adam Foltzer <acfolt...@gmail.com> wrote: > > > Hi all, > > > > I'm trying to set up DataImportHandler to index some XML documents > > available > > over web services. The XML includes both content and metadata, so for the > > indexable content, I'm trying to just index everything under the content > > tag: > > > > <entity dataSource="kbws" name="kbxml" pk="title" > > url="resturl" processor="XPathEntityProcessor" > > forEach="/document" transformer="HTMLStripTransformer" > > flatten="true"> > > <field column="content" name="content" xpath="/document/kbml/body" > > flatten="true" stripHTML="true" /> > > <field column="title" name="title" xpath="/document/kbml/kbq" /> > > </entity> > > > > The result of this is that the title field gets populated and indexed > > (there > > are no child nodes of /document/kbml/kbq), but content does not get > indexed > > at all. Since /document/kbml/body has many children, I expected that > > flatten="true" would store all of the body text in the field. Instead, it > > stores nothing at all. I've tried this with many combinations of > > transformers and flatten options, and the result is the same each time. > > > > > Which Solr version are you using? The flatten attribute was introduced > after > 1.3 released. > > -- > Regards, > Shalin Shekhar Mangar. >