Hi, Hmm, the tool is getting bloated for a one-class no-deps tool already :) Guess it would be useful too with real-life code examples using SolrJ and other libs as well (such as robots.txt lib, commons-cli etc), but whether that should be an extension of SimplePostTool or a totally new tool from scratch is something to discuss. Please bring on your ideas of how you plan to extend it, perhaps even simplifying the code in the process?
-- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com Solr Training - www.solrtraining.com 3. feb. 2013 kl. 17:19 skrev Upayavira <u...@odoko.co.uk>: > I have a scenario in which I need to post 500,000 documents to Solr as a > test. I have these documents in XML files already formatted in Solr's > xml format. > > Posting to Solr using post.jar it takes 1m55s. With a bit of bash > jiggery-pokery, I was able to get this down to 1m08s by running four > concurrent post.jar instances, which strikes me as a significant > improvement. > > I'm considering adding multithreaded capabilities to post.jar, but > before I go to that effort, I wanted to see if anyone else would > consider it a useful feature. Given that the SimplePostTool is becoming > far from simple, I wanted to see whether the feature is likely to be > accepted before I put in the effort. Also, I would need to consider > which parts of the tool to add that to. Currently I only want it for > posting XML docs, but there's also crawling capabilities in it too. > > Thoughts? > > Upayavira