Aswath Srinivasan (TMS) <aswath.sriniva...@toyota.com> wrote: > * Totally about 2.5 million documents to be indexed > * Documents average size is 512 KB - pdfs and htmls
> This being said I was thinking I would take the Solr to production with, > * 2 shards, 1 Leader & 3 Replicas > Do you all think this set up will work? Will this server me 150 QPS? It certainly helps that you are batch updating. What is missing in this estimation is how large the documents are when indexed, as I guess the ½MB average is for the raw files? If they are your everyday short PDFs with images, meaning not a lot of text, handling 2M+ of them is easy. If they are all full-length books, it is another matter. Your document count is relatively low and if your index data end up being not-too-big (let's say 100GB), then you ought to consider having just a single shard with 4 replicas: There is a non-trivial overhead going from 1 shard to more than one, especially if you are doing faceting. - Toke Eskildsen