A few others have posted about this too apparently and SOLR-4413 is the root problem. Basically what I am seeing is that if your index directory is not index/ but rather index.<timestamp> set in the index.properties a new index will be downloaded all the time because the download is expecting your index to be in <solr_data_dir>/index. Sounds like a quick solution might be to rename your index directory to just "index" and see if the problem goes away.
To confirm, look at line 728 in the SnapPuller.java file (in downloadIndexFiles) I am hoping that the patch and a more unified "getIndexDir" can be added to the next release of Solr as this is a fairly significant bug to me. Cheers Amit On Thu, Feb 21, 2013 at 12:56 AM, Amit Nithian <anith...@gmail.com> wrote: > So the diff in generation numbers are due to the commits I believe that > Solr does when it has the new index files but the fact that it's > downloading a new index each time is baffling and I just noticed that too > (hit the replicate button and noticed a full index download). I'm going to > pop in to the source and see what's going on to see why unless there's a > known bug filed about this? > > > On Tue, Feb 19, 2013 at 1:48 AM, Raúl Grande Durán < > raulgrand...@hotmail.com> wrote: > >> >> Hello. >> We have recently updated our Solr from 3.5 to 4.1 and everything is >> running perfect except the replication between nodes. We have a >> master-repeater-2slaves architecture and we have seen some things that >> weren't happening before: >> When a Slave (repeater or slaves) starts to replicate it needs to >> download the entire index. Even when some little changes has been made to >> the index at master. This takes such a long time since our index is more >> than 20 Gb.After replication cycle we have different index generations in >> master, repeater and slaves. For example:Master: gen. 64590Repeater: gen. >> 64591Both slaves: gen. 64592 >> My replicationHandler configuration is like this:<requestHandler >> name="/replication" class="solr.ReplicationHandler" > <lst >> name="master"> <str name="enable">${enable.master:false}</str> >> <str name="replicateAfter">commit</str> <str >> name="replicateAfter">startup</str> <str >> name="confFiles">schema.xml,stopwords.txt</str> </lst> <lst >> name="slave"> <str name="enable">${enable.slave:false}</str> >> <str name="masterUrl">${solr.master.url:"http://localhost/solr"}</str> >> <str name="pollInterval">00:03:00</str> </lst> </requestHandler> >> Our problems are very similar to those explained here: >> http://lucene.472066.n3.nabble.com/Problem-with-replication-td2294313.html >> Any ideas?? Thanks > > >