Yes, No, Maybe.
bq; Specifically the performance we want to increase is time to facet
data, time to cluster data and search time
Well, that about covers everything ;)
You cannot talk about this without also taking about cache warming. Given your
setup, I'm guessing you have very few searches on
Thank you Erick. What about cache size? If we add replicas to our
cluster and each replica has nGBytes of RAM allocated for HDFS caching,
would that help performance? Specifically the performance we want to
increase is time to facet data, time to cluster data and search time.
While we index
Yes. Maybe. It Depends (tm).
Details matter (tm).
If you're firing just a few QPS at the system, then improved
throughput by adding replicas is unlikely. OTOH, if you're firing lots
of simultaneous queries at Solr and are pegging the processors, then
adding replication will increase aggregate QPS
Hi - we currently have a multi-shard setup running solr cloud without
replication running on top of HDFS. Does it make sense to use
replication when using HDFS? Will we expect to see a performance
increase in searches?
Thank you!
-Joe