Hi great community, I have a SolrCloud with the following configuration:
- 2 nodes (r3.2xlarge 61GB RAM) - 4 shards. - The producer can produce 13,000+ docs per second - The schema contains about 300+ fields and the document size is about 3KB. - Using SolrJ and SolrCloudClient, each batch to solr contains 500 docs. When I start my bulk indexer program the CPU utilization is 100% on each server but the rate of the indexer is about 1500 docs per second. I know that some solr benchmarks reached 70,000+ doc. per second. The question: What is the best way to determine the bottleneck on solr indexing rate? Thanks, Mahmoud