You're right, you can't store an XML document directly in Solr.
You have to pull it apart and index it such that you can get whatever
information back you need.

How you flatten data depends entirely upon your needs. The high-level
idea is that you want to create fields such that text searches work. The
moment you start thinking about "how can I express a relationship
in the query", back up and try to flatten the data so you can just *search*.

This is vague, I know. But so much depends on how you want to use
the data that specifics are hard to give.

You've gotta take off your DB hat and not worry about duplicating
data. De-normalize lots and lots and lots first...

Best
Erick

On Wed, May 18, 2011 at 5:27 PM, Judioo <cont...@judioo.com> wrote:
> Great document. I can see how to import the data direct from the database.
> However it seems as though I need to write xpath's in the config to extract
> the fields that I wish to transform into an solr document.
>
> So it seems that there is no way of storing the document structure in solr
> as is?
>
>
> 2011/5/18 Yury Kats <yuryk...@yahoo.com>
>
>> On 5/18/2011 4:19 PM, Judioo wrote:
>>
>> > Any help is greatly appreciated. Pointers to documentation that address
>> my
>> > issues is even more helpful.
>>
>> I think this would be a good start:
>>
>> http://wiki.apache.org/solr/DataImportHandler#Usage_with_XML.2BAC8-HTTP_Datasource
>>
>

Reply via email to