Re: replication and HDFS

2015-08-31 Thread Erick Erickson
Yes, No, Maybe. bq; Specifically the performance we want to increase is time to facet data, time to cluster data and search time Well, that about covers everything ;) You cannot talk about this without also taking about cache warming. Given your setup, I'm guessing you have very few searches on

Re: replication and HDFS

2015-08-31 Thread Joseph Obernberger
Thank you Erick. What about cache size? If we add replicas to our cluster and each replica has nGBytes of RAM allocated for HDFS caching, would that help performance? Specifically the performance we want to increase is time to facet data, time to cluster data and search time. While we index

Re: replication and HDFS

2015-08-20 Thread Erick Erickson
Yes. Maybe. It Depends (tm). Details matter (tm). If you're firing just a few QPS at the system, then improved throughput by adding replicas is unlikely. OTOH, if you're firing lots of simultaneous queries at Solr and are pegging the processors, then adding replication will increase aggregate QPS

replication and HDFS

2015-08-20 Thread Joseph Obernberger
Hi - we currently have a multi-shard setup running solr cloud without replication running on top of HDFS. Does it make sense to use replication when using HDFS? Will we expect to see a performance increase in searches? Thank you! -Joe