On Tue, Jan 12, 2010 at 1:09 PM, Smiley, David W. <dsmi...@mitre.org> wrote:
> The beauty of StreamingUpdateSolrServer is that you don't have to worry about 
> batch sizes; it streams them all.  Just keep calling add() with one document 
> and it'll get enqueued.  You can pass a collection but there's no performance 
> benefit.

Right - and the problem with building your own collection and passing
it is that it's not being streamed (if it takes any time to build
those docs - like reading from a DB - then that thread may be idle for
some amount of time).  If you separate and make document production
asynchronous from document sending, then you've just re-invented
StreamingUpdateSolrServer.

I'd really recommend just starting with StreamingUpdateSolrServer for
any amount of indexing.

-Yonik
http://www.lucidimagination.com

Reply via email to