A quick update on my experiments with update rate: * 20 docs/sec using one wget call per POST * 170 docs/sec using single doc POST over a persistent HTTP connection * 250 docs/sec using 20 doc batches over persistent HTTP * 250 docs/sec using 100 doc batches over persistent HTTP
The latter three used a commit every 2000 docs (not batches) and an optimize every 10,000 docs. Each submitted document is between 200 and 700 bytes, pretty small. I didn't try parallel connections, since this speed is just fine. This is using the default settings for merge factor, max buffered docs, and so on. wunder -- Walter Underwood Search Guru, Netflix