I am using SolrCloud My initial requirements are:
1) There are about 6000 clients 2) The number of documents from each client are about 500000 (average document size is about 400 bytes) 3 I have to wipe off the index/collection every night and create new Any thoughts/ideas/suggestions on: 1) How to index such large number of documents i.e. do I use an http client to send documents or is data import handler right or should I try uploading CSV files? 2) How many collections should I use? 3) How many shards / replicas per collection should I use? 4) Do I need multiple Solr servers? Thanks