My apologies. We're running 2 different sets of SOLR instances, the version with the replication problems is 6.2.0 and NOT 4.8.0.
After reading your questions, I double checked the config and we're set to autoCommit after a LOOOOONG time. <autoCommit> <maxTime>36000000</maxTime> <openSearcher>true</openSearcher> </autoCommit> Also using softCommit as well: <autoSoftCommit> <maxTime>60000</maxTime> </autoSoftCommit> After looking at our other servers, I changed the autoCommit maxTime down to 180000 to match. Hoping this might be our issue ;-) I'll report back in a bit though! Thanks for the reminder about commits. We may have forgotten to change these values when these new servers got setup. Scott -----Original Message----- From: Andrea Gazzarini [mailto:gxs...@gmail.com] Sent: Friday, March 10, 2017 3:12 AM To: solr-user@lucene.apache.org Subject: Re: SOLR 4.8.0 Master/Slave Replication Issues Hi Scott, that could depend on a lot of things. Some questions: * What is your commit policy? Explicit / auto / soft / hard ... * "Other days things are off by a small amount between master and slave"...what do you mean exactly? What is the behaviour you see in terms of index versions between nodes? * Any error / message in the log file? Honestly I never used Solr 4.8.0 (this specific version) but I have a lot of customers with other 4.x (mostly 4.10.4) master / slave configuration and they don't have such problems. Best, Andrea On 09/03/17 22:26, Pouliot, Scott wrote: > So we've been having oddball replication issues between master and slave (Not > SOLRCloud) for a few weeks now. Some days...everything lines up just fine. > Other days things are off by a small amount between master and slave. > > Here is the config on the MASTER server replication config: > > <!-- ReplicationHandler --> > <requestHandler name="/replication" class="solr.ReplicationHandler" > > <lst name="master"> > <!--Replicate on 'startup' and 'commit'. 'optimize' is also a valid > value for replicateAfter. --> > <str name="replicateAfter">startup</str> > <str name="replicateAfter">commit</str> > > <!--The default value of reservation is 10 secs.See the documentation > below . Normally , you should not need to specify this --> > <str name="commitReserveDuration">00:00:10</str> > </lst> > <!-- keep only 1 backup. Using this parameter precludes using the > "numberToKeep" request parameter. (Solr3.6 / Solr4.0)--> > <!-- (For this to work in conjunction with "backupAfter" with Solr > 3.6.0, see bug fix https://issues.apache.org/jira/browse/SOLR-3361 )--> > <str name="maxNumberOfBackups">1</str> > </requestHandler> > > Here is the SLAVE server replication config: > > <requestHandler name="/replication" class="solr.ReplicationHandler" > > <lst name="slave"> > > <!--fully qualified url to the master core. It is possible to pass on > this as a request param for the fetchindex command--> > <str > name="masterUrl">http://server01:8080/solr/${solr.core.name}</str> > > <!--Interval in which the slave should poll master .Format is HH:mm:ss > . If this is absent slave does not poll automatically. > But a fetchindex can be triggered from the admin or the http API --> > <str name="pollInterval">00:01:00</str> > </lst> > </requestHandler> > > It seems very sporadic at best. Sometimes I can manually click "Replicate > Now" from the replication page in the SOLR UI and it will work, sometimes it > does nothing. Sometimes I can go to the master server, and if the "Optimize" > button is available and I click it....it will replicate as soon as the > Optimization is done. > > Is this a bug in SOLR? Could we possibly have some settings out of whack > here? I've been digging around online for a bit and not finding much info > here. > > This is on an older version of SOLR though, so wondering if maybe it used to > be a bug? > > Thanks! > > Scott