Hi, I am doing some testing of dataimport to Solr from XML-documents with many children in the children. To parse the children i some levels down using Xpath goes fine, but the speed is very slow. (~1 minute per document, on a quad Xeon server). When i do the same using the format solr wants it, the parsing time is 0.02 seconds per document.
I have published a quick example here: http://pastebin.com/adhcEvRx My question is: I hope that i have done something wrong in the child-parsing (as you can see, it goes down quite a few levels). Can anybody point me in the right direction so i can speed up the process? I have been looking around for some examples, but nobody gives examples of such deep data indexing. PS: I know there are some bugs in the Xpath naming etc, but it is just a rough example :) -- Best regars Tor Henning Ueland