Re: Configuration and specs to index a 1 terabyte (TB) repository

Toke Eskildsen Wed, 30 Oct 2013 07:43:41 -0700

On Wed, 2013-10-30 at 14:24 +0100, Shawn Heisey wrote:
> On 10/30/2013 4:00 AM, Toke Eskildsen wrote:
> > Why would TRIM have any influence on whether or not a driver failure
> > also means server failure?
> 
> I left out a step in my description.
> 
> Lack of TRIM support in RAID means that I would avoid RAID with SSD.  No
> RAID means that when the SSD fails, that Solr is out of commission until
> its SSD can be replaced.


That makes sense, thanks.

> I don't know how Solr would behave if you put each core on its own SSD
> and one of them fails.  Hopefully it's smart enough to keep going with
> the cores that have working filesystems.

I don't know either. Seems like it would be a useful thing to test.

We did some comparison on 9 shards of 420GB (against a SAN), where we
tested SolrCloud with 9 independent Solr instances vs. a single instance
with multiple cores. The overhead of independent instances did not seem
severe for that shard size and should be resilient against single drive
failure.

As we're looking at a cumulative heap requirement of 100GB+ due to
grouping and faceting, it might be preferable to run with independent
Solrs anyway to minimize garbage collection pauses. I do not know if
that logic extends in general to large Solr installations.

Regards,
Toke Eskildsen, State and University Library, Denmark

Re: Configuration and specs to index a 1 terabyte (TB) repository

Reply via email to