How does the cluster react to the overseer q entries disapeering?


Jeff Courtade
M: 240.507.6116

On Aug 22, 2017 8:01 AM, "Hendrik Haddorp" <hendrik.hadd...@gmx.net> wrote:

> Hi Jeff,
>
> we ran into that a few times already. We have lots of collections and when
> nodes get started too fast the overseer queue grows faster then Solr can
> process it. At some point Solr tries to redo things like leaders votes and
> adds new tasks to the list, which then gets longer and longer. Once it is
> too long you can not read out the data anymore but Solr is still adding
> tasks. In case you already reached that point you have to start ZooKeeper
> and the ZooKeeper client with and increased "jute.maxbuffer" value. I
> usually double it until I can read out the queue again. After that I delete
> all entries in the queue and then start the Solr nodes one by one, like
> every 5 minutes.
>
> regards,
> Hendrik
>
> On 22.08.2017 13:42, Jeff Courtade wrote:
>
>> Hi,
>>
>> I have an issue with what seems to be a blocked up /overseer/queue
>>
>> There are 700k + entries.
>>
>> Solr cloud 6.x
>>
>> You cannot addreplica or deletereplica the commands time out.
>>
>> Full stop and start of solr and zookeeper does not clear it.
>>
>> Is it safe to use the zookeeper supplied zkCli.sh to simple rmr the
>> /overseer/queue ?
>>
>>
>> Jeff Courtade
>> M: 240.507.6116
>>
>>
>

Reply via email to