Glad it worked out, I like to avoid heroics whenever possible ;)… It can take quite some time for solr to finally and for good give up, waiting 10-15 minutes for something to change seems like an eternity.
What’s happening here is the node attempts to recover but fails for some reason. So it backs off and tries again. And again… before throwing in the towel. The parallel collection is also kind of a last-ditch thing to try, but at least it keeps the old collection around so you can try heroics if the parallel collection doesn’t work ;). Best, Erick > On Feb 14, 2020, at 11:53 AM, tedsolr <tsm...@sciquest.com> wrote: > > Overnight the replicas with a state of "down" changed to "recovery_failed". > Nothing I did. So I brought down both nodes, then started one and waited 5 > min. A leader was born then I started the other node. So luckily no heroics > were needed. > > I'll remember your advice about creating a parallel collection and copying > the data directory. > > > > -- > Sent from: https://lucene.472066.n3.nabble.com/Solr-User-f472068.html