On Wed, 2015-04-08 at 14:00 -0700, pras.venkatesh wrote:
> 1. 8 nodes, 4 shards(2 nodes per shard)
> 2. each node having about 55 GB of Data, in total there is 450 million
> documents in the collection. so the document size is not huge, 

So ~120M docs/shard.

> 3. The schema has 42 fields, it gets reloaded every 15 mins with about
> 50,000 documents. Now we have primary Key for the index, so when there are
> any duplicates the document gets re-written.
> 4. The GC policy is CMS, with heap size min and max = 8 gb and perm size =
> 512 mb and RAM on the VM is 24 gb.

Do you have a large and active filter cache? Each entry is 30MB, so it
does not take many entries to fill a 8GB heap. That would match the
description of ever-running GC.

- Toke Eskildsen, State and University Library, Denmark


Reply via email to