[ 
https://issues.apache.org/jira/browse/GEODE-7760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17035469#comment-17035469
 ] 

ASF subversion and git services commented on GEODE-7760:
--------------------------------------------------------

Commit dcca237446d9e33a853ecc192eab0c9e7497012e in geode's branch 
refs/heads/develop from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=dcca237 ]

GEODE-7760: fixing LocatorUDPSecurityDUnitTest.testCrashLocatorMultip… (#4683)

* GEODE-7760: fixing LocatorUDPSecurityDUnitTest.testCrashLocatorMultipleTimes

This new test exposed a flaw in auto-reconnect where the quorum-checker
buffers messages from other members but the new Membership messenger is
unable to decrypt them when it starts up due to not having the old
encryption keys.

The encrypt/decrypt objects from the old Membership are now kept and
provided to the new Messenger during auto-reconnect.

* now that we have an encryptor we can process buffered messages

* fixing ID comparison for auto-reconnect.

the local identifier UUID and view-ID change during auto-reconnect,
causing the GMSEncrypt to not be able to locate the local encrypt/decrypt
object.

* added logging detail to track down CI failure

* more relaxation on id comparison in gmsencrypt


> NPE in reconnecting Locator
> ---------------------------
>
>                 Key: GEODE-7760
>                 URL: https://issues.apache.org/jira/browse/GEODE-7760
>             Project: Geode
>          Issue Type: Bug
>          Components: management, membership
>            Reporter: Bruce J Schuchardt
>            Assignee: Bruce J Schuchardt
>            Priority: Major
>             Fix For: 1.12.0
>
>          Time Spent: 2h
>  Remaining Estimate: 0h
>
> A v1.9 locator was forced out of the cluster and then threw an NPE when it 
> was reconnecting.  Apparently the new DistributedSystem was also kicked out 
> of the cluster during this time.
> {noformat}
> [fatal 2019/12/18 02:14:55.647 UTC <Location services restart thread> 
> tid=0xb1a5] Uncaught exception in thread Thread[Location services restart 
> thread,5,main]
> java.lang.NullPointerException
>         at 
> org.apache.geode.distributed.internal.InternalLocator.startClusterManagementService(InternalLocator.java:690)
>         at 
> org.apache.geode.distributed.internal.InternalLocator.restartWithDS(InternalLocator.java:1124)
>         at 
> org.apache.geode.distributed.internal.InternalLocator.attemptReconnect(InternalLocator.java:1062)
>         at 
> org.apache.geode.distributed.internal.InternalLocator.lambda$launchRestartThread$1(InternalLocator.java:983)
>         at java.lang.Thread.run(Thread.java:748)
> {noformat}
> This wasn't from a CI run and there are no other artifacts available.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to