On 3/16/2014 10:34 AM, adfel70 wrote:
> I have a 12-node solr 4.6.1 cluster. each node has 2 solr procceses, running
> on 8gb heap jvms. each node has total of 64gb memory.
> My current collection (7 shards, 3 replicas) has around 500 million docs. 
> I'm performing bulk indexing into the collection. I set softCommit to 10
> minutes and hardCommit openSearcher=false to 15 minutes.

How much index data does each server have on it?  This would be the sum
total of the index directories of all your cores.

> I recently started seeing the following problems while indexing - every 10
> minutes ( and I assume that this is the 10minutes soft-commit cycles) I get
> the following errors:
> 1. EofExcpetion from jetty in HttpOutput.write send from SolrDispatchFilter
> 2. queries to all cores start getting high latencies (more the 10 seconds)

EofException errors happen when your client disconnects before the
request is complete.  I would strongly recommend that you *NOT*
configure hard timeouts for your client connections, or that you make
them really long, five minutes or so.  For SolrJ, this is the SO_TIMEOUT.

These problems sound like one of two things.  It could be either or both:

1) You don't have enough RAM to cache your index effectively.  With 64GB
of RAM and 16GB heap, you have approximately 48GB of RAM left over for
other software and the OS disk cache.  If the total index size on each
machine is in the neighborhood of 60GB (or larger), this might be a
problem.  If you have software other than Solr running on the machine,
you must subtract it's direct and indirect memory requirements from the
available OS disk cache.

2) Indexing results in a LOT of object creation, most of which exist for
a relatively short time.  This can result in severe problems with
garbage collection pauses.

Both problems listed above (and a few others) are discussed at the wiki
page linked below.  As you will read, there are two major causes of GC
symptoms - a heap that's too small and incorrect (or nonexistent) GC
tuning.  With a very large index like yours, either or both of these GC
symptoms could be happening.

http://wiki.apache.org/solr/SolrPerformanceProblemshttp://wiki.apache.org/solr/SolrPerformanceProblems

Side note: You should only be running one Solr process per machine.
Running multiple processes creates additional memory overhead.  Any hard
limits that you might have run into with a single Solr process can be
overcome with configuration options for Jetty, Solr, or the operating
system.

Thanks,
Shawn

Reply via email to