On Tue, Jan 12, 2010 at 1:09 PM, Smiley, David W. <dsmi...@mitre.org> wrote: > The beauty of StreamingUpdateSolrServer is that you don't have to worry about > batch sizes; it streams them all. Just keep calling add() with one document > and it'll get enqueued. You can pass a collection but there's no performance > benefit.
Right - and the problem with building your own collection and passing it is that it's not being streamed (if it takes any time to build those docs - like reading from a DB - then that thread may be idle for some amount of time). If you separate and make document production asynchronous from document sending, then you've just re-invented StreamingUpdateSolrServer. I'd really recommend just starting with StreamingUpdateSolrServer for any amount of indexing. -Yonik http://www.lucidimagination.com