I am using SolrCloud

My initial requirements are:

1) There are about 6000 clients
2) The number of documents from each client are about 500000 (average
document size is about 400 bytes)
3 I have to wipe off the index/collection every night and create new

Any thoughts/ideas/suggestions on:

1) How to index such large number of documents i.e. do I use an http client
to send documents or is data import handler right or should I try uploading
CSV files?

2) How many collections should I use?

3) How many shards / replicas per collection should I use?

4) Do I need multiple Solr servers?

Thanks

Reply via email to