Yes, I have erased the tlog in replica 2 and it appears that the the first replica's tlog was corrupted because of an ungracefull servlet shutdown. There was no log for it unfortunately, neither the zookeeper log logged anything about this. Is there a a place I could check in the zookeeper what exactly happened during this election?
Partly connected - about transient disk that needs to be free for the replication after sync failure - why does it replicate all the index instead of copying just the newer formed segments? That would require much less space than a full copy, wouldn't it? Why not making 100 docs for tlog sync configurable?