On 4/15/2013 3:38 AM, Victor Ruiz wrote:
About SolrCloud, I know it doesn't use master-slave replication, but
incremental updates, item by item. That's why I thought it could work for
us, since our bottleneck appear to be the replication cycles. But another
point is, if the indexing occurs in all servers, 1200 updates/min could also
overload the servers? and therefore have a worst performance than with
master-slave replication?

One version (4.1, I think) has a problem that results in the entire index being replicated every time. The I/O required for that makes everything slow down on both master and slave.

There are reports of new master/slave replication problems with 4.2 and 4.2.1, but I'm not entirely clear on whether those are just cosmetic problems with index version reporting or whether some people are having actual real problems.

In 3.x and older, replication was generally the best option for multiple copies of your index, because there was no NRT indexing capability. Updating the index was a resource-intensive process with a high impact on searching, loading a replicated index was better.

Version 4.x adds NRT capabilities, so indexing impacts searches far less than it used to. SolrCloud with NRT features (frequent soft commits, less frequent hard commits) is the recommended configuration path now.

Thanks,
Shawn

Reply via email to