On Wed, 2015-04-08 at 14:00 -0700, pras.venkatesh wrote: > 1. 8 nodes, 4 shards(2 nodes per shard) > 2. each node having about 55 GB of Data, in total there is 450 million > documents in the collection. so the document size is not huge,
So ~120M docs/shard. > 3. The schema has 42 fields, it gets reloaded every 15 mins with about > 50,000 documents. Now we have primary Key for the index, so when there are > any duplicates the document gets re-written. > 4. The GC policy is CMS, with heap size min and max = 8 gb and perm size = > 512 mb and RAM on the VM is 24 gb. Do you have a large and active filter cache? Each entry is 30MB, so it does not take many entries to fill a 8GB heap. That would match the description of ever-running GC. - Toke Eskildsen, State and University Library, Denmark