Lucene 5 has added a lot of various CRCs to catch index corruption situations. I don't know if it is 'perfect', but there was certainly a lot of work.
Regards, Alex. ---- Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: http://www.solr-start.com/ On 12 March 2015 at 18:39, Markus Jelsma <markus.jel...@openindex.io> wrote: > Thanks for sharing Toke! > > Reliability should not be a problem for a Solr cloud environment. A corrupted > index cannot be loaded due to exceptions so the core should not enter an > active state. However, what would happen if parts of the data become > corrupted but can still be processed by the codec? I don't even know if the > data has a CRC check to guard against such madness? > > Markus > > -----Original message----- >> From:Toke Eskildsen <t...@statsbiblioteket.dk> >> Sent: Thursday 12th March 2015 21:33 >> To: solr-user <solr-user@lucene.apache.org> >> Subject: SSD endurance >> >> For those who have not yet taken the leap to SSD goodness because they are >> afraid of flash wear, the burnout test from The Tech Report seems worth a >> read. The short story is that they wrote data to the drives until they wore >> out. All tested drives survived considerably longer than guaranteed, but 4/6 >> failed catastrophically when they did die. >> >> http://techreport.com/review/27909/the-ssd-endurance-experiment-theyre-all-dead >> >> I am disappointed about the catastrophic failures. One of the promises of >> SSDs was graceful end of life by switching to read-only mode. Some of them >> did give warnings before the end, but I wonder how those are communicated in >> a server environment? >> >> >> Regarding Lucene/Solr, the write pattern when updating an index is benign to >> SSDs: Updates are relatively bulky, rather than the evil >> constantly-flip-random-single-bits-and-flush pattern of databases. With >> segments being immutable, the bird's eye view is that Lucene creates and >> deletes large files, which makes it possible for the SSD's wear-leveler to >> select the least-used flash sectors for new writes: The write pattern over >> time is not too far from the one that The Tech Report tested with. >> >> - Toke Eskildsen >> Whose trusty old 160GB Intel X25-M reports an accumulated 36TB of writes. >>