On Fri, 2012-11-16 at 02:18 +0100, Buttler, David wrote: > Obviously, I could replicate the data so > that I wouldn't lose any documents while I replace my disk, but since I > am already storing the original data in HDFS, (with a 3x replication), > adding additional replication for solr eats into my disk budget a bit > too much.
Nevertheless, limiting damage by excessive sharding is a very peculiar decision. How many bytes are we talking about here? Do you have multi-TB indexes?