Hi,

I am doing some testing of dataimport to Solr from XML-documents with
many children in the children. To parse the children i some levels
down using Xpath goes fine, but the speed is very slow. (~1 minute per
document, on a quad Xeon server). When i do the same using the format
solr wants it, the parsing time is 0.02 seconds per document.

I have published a quick example here:
http://pastebin.com/adhcEvRx

My question is:

I hope that i have done something wrong in the child-parsing  (as you
can see, it goes down quite a few levels). Can anybody point me in the
right direction so i can speed up the process?  I have been looking
around for some examples, but nobody gives examples of such deep data
indexing.

PS: I know there are some bugs in the Xpath naming etc, but it is just
a rough example :)

-- 
Best regars
Tor Henning Ueland

Reply via email to