Re: Tips on recursive xml-parsing in dataConfig

Geert-Jan Brits Mon, 07 Jun 2010 07:12:28 -0700

I'm guessing (I'm not familiar with the xml dataimport handler, but I am
pretty familiar with Xpath)
that your problem lies in having absolute xpath-queries, instead of relative
xpath queries to your parent node.


e.g: /DOK/TEKST/KAP is absolute ( the prefixed '/' tells it to be). Try
'KAP' instead.
The same for all xpaths deeper in the tree.

Geert-Jan

2010/6/7 Tor Henning Ueland <tor.henn...@gmail.com>

> Hi,
>
> I am doing some testing of dataimport to Solr from XML-documents with
> many children in the children. To parse the children i some levels
> down using Xpath goes fine, but the speed is very slow. (~1 minute per
> document, on a quad Xeon server). When i do the same using the format
> solr wants it, the parsing time is 0.02 seconds per document.
>
> I have published a quick example here:
> http://pastebin.com/adhcEvRx
>
> My question is:
>
> I hope that i have done something wrong in the child-parsing  (as you
> can see, it goes down quite a few levels). Can anybody point me in the
> right direction so i can speed up the process?  I have been looking
> around for some examples, but nobody gives examples of such deep data
> indexing.
>
> PS: I know there are some bugs in the Xpath naming etc, but it is just
> a rough example :)
>
> --
> Best regars
> Tor Henning Ueland
>

Re: Tips on recursive xml-parsing in dataConfig

Reply via email to