On Apr 27, 2012, at 9:54 AM, ravi wrote:

> Hey Mark, Thanks again for your reply. 
> 
> /*"The way we know that its no longer connected to zookeeper is looking at
> live_nodes - which are ephemeral and will go away if a node goes away"*/
> 
> i am not too sure if this is really the case. As far as i remember, even
> after a node was dead, live_nodes still reported that node as active /but/
> the leader was changed to the one that was /really/ alive. 

Should not be the case.

> 
> I had a look in the Overseer's code and it seems its looping on FIFO queue
> and wait for new state update requests. So if a node was killed, it would
> never be sending a state update request and i guess that's why the state is
> out of sync. 
> 
> If we can set up a wait time for each known node and then declare a node as
> INACTIVE if overseer does not hear from that node within the wait time.
> Something similar to heartbeats in several other systems. 

The /live_nodes stuff does use a heartbeat. That's why we use it as we do in 
combination with the state.

> 
> 
> 
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/SolrCloud-Programmatically-create-multiple-collections-tp3916927p3944327.html
> Sent from the Solr - User mailing list archive at Nabble.com.

- Mark Miller
lucidimagination.com











Reply via email to