On Tue, Oct 6, 2009 at 9:29 PM, Adam Foltzer <acfolt...@gmail.com> wrote:
> Hi all, > > I'm trying to set up DataImportHandler to index some XML documents > available > over web services. The XML includes both content and metadata, so for the > indexable content, I'm trying to just index everything under the content > tag: > > <entity dataSource="kbws" name="kbxml" pk="title" > url="resturl" processor="XPathEntityProcessor" > forEach="/document" transformer="HTMLStripTransformer" > flatten="true"> > <field column="content" name="content" xpath="/document/kbml/body" > flatten="true" stripHTML="true" /> > <field column="title" name="title" xpath="/document/kbml/kbq" /> > </entity> > > The result of this is that the title field gets populated and indexed > (there > are no child nodes of /document/kbml/kbq), but content does not get indexed > at all. Since /document/kbml/body has many children, I expected that > flatten="true" would store all of the body text in the field. Instead, it > stores nothing at all. I've tried this with many combinations of > transformers and flatten options, and the result is the same each time. > > Which Solr version are you using? The flatten attribute was introduced after 1.3 released. -- Regards, Shalin Shekhar Mangar.