Aswath Srinivasan (TMS) <aswath.sriniva...@toyota.com> wrote:
> *         Totally about 2.5 million documents to  be indexed
> *         Documents average size is 512 KB - pdfs and htmls

> This being said I was thinking I would take the Solr to production with,
> *         2 shards, 1 Leader & 3 Replicas

> Do you all think this set up will work? Will this server me 150 QPS?

It certainly helps that you are batch updating. What is missing in this 
estimation is how large the documents are when indexed, as I guess the ½MB 
average is for the raw files? If they are your everyday short PDFs with images, 
meaning not a lot of text, handling 2M+ of them is easy. If they are all 
full-length books, it is another matter.

Your document count is relatively low and if your index data end up being 
not-too-big (let's say 100GB), then you ought to consider having just a single 
shard with 4 replicas: There is a non-trivial overhead going from 1 shard to 
more than one, especially if you are doing faceting.

- Toke Eskildsen

Reply via email to