[jira] [Commented] (GEODE-8442) Exception in server not identified correctly in client

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232747#comment-17232747
 ] 

ASF GitHub Bot commented on GEODE-8442:
---

mkevo opened a new pull request #693:
URL: https://github.com/apache/geode-native/pull/693


   HandleReplay is already commented in other functions as replay is 
unsupported, this change will not translate the real exception to 
UnknownException.
   User will got correct exception and will know how to handle it.
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Exception in server not identified correctly in client
> --
>
> Key: GEODE-8442
> URL: https://issues.apache.org/jira/browse/GEODE-8442
> Project: Geode
>  Issue Type: Bug
>  Components: native client
>Reporter: Alberto Bustamante Reyes
>Assignee: Mario Kevo
>Priority: Major
>
> Native client is not identifying correctly an exception returned by the 
> server.
> This is a log from an exception properly identified (I have introduced "" 
> where I think I have to hide sensitive information):
> {code}
> [error 2020/07/22 09:17:10.831337 UTC ] 
> CacheTransactionManager::failover: An exception 
> (org.apache.geode.cache.TransactionDataNodeHasDepartedException: Could not 
> find transaction host for TXId: 
> (default_GeodeDS:1:loner):3:Native_gdbdedfebe1:default_GeodeDS:6022
>   at 
> org.apache.geode.internal.cache.tier.sockets.command.TXFailoverCommand.cmdExecute(TXFailoverCommand.java:126)
>   at 
> org.apache.geode.internal.cache.tier.sockets.BaseCommand.execute(BaseCommand.java:177)
>   at 
> org.apache.geode.internal.cache.tier.sockets.ServerConnection.doNormalMessage(ServerConnection.java:848)
>   at 
> org.apache.geode.internal.cache.tier.sockets.OriginalServerConnection.doOneMessage(OriginalServerConnection.java:72)
>   at 
> org.apache.geode.internal.cache.tier.sockets.ServerConnection.run(ServerConnection.java:1212)
>   at 
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
>   at 
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
>   at 
> org.apache.geode.internal.cache.tier.sockets.AcceptorImpl.lambda$initializeServerConnectionThreadPool$3(AcceptorImpl.java:676)
>   at 
> org.apache.geode.logging.internal.executors.LoggingThreadFactory.lambda$newThread$0(LoggingThreadFactory.java:119)
>   at java.base/java.lang.Thread.run(Thread.java:834)
> ) happened at remote server.
> data-access: write_element:[TransactionDataNodeHasDepartedException] 
> .get()::apache::geode::client::TransactionDataNodeHasDepartedException:org.apache.geode.cache.TransactionDataNodeHasDepartedException:
>  Could not find transaction host for TXId: 
> (default_GeodeDS:1:loner):3:Native_gdbdedfebe1:default_GeodeDS:6022
> {code}
> But in this case, although the exception received is also a 
> TransactionDataNodeHasDepartedException, it is "translated" to an Unknown 
> exception:
> {code}
> [error 2020/07/22 09:17:10.979506 UTC ] Region::containsKeyOnServer:: An 
> exception (org.apache.geode.cache.TransactionDataNodeHasDepartedException: 
> Transaction data node 192.168.240.13(dms-server-1:1):41000 has departed. 
> To proceed, rollback this transaction and begin a new one., caused by 
> org.apache.geode.internal.cache.ForceReattemptException: PartitionResponse 
> got remote CacheClosedException
>   at 
> org.apache.geode.internal.cache.tx.PartitionedTXRegionStub.containsKey(PartitionedTXRegionStub.java:247)
>   at 
> org.apache.geode.internal.cache.TXStateStub.containsKey(TXStateStub.java:431)
>   at 
> org.apache.geode.internal.cache.TXStateProxyImpl.containsKey(TXStateProxyImpl.java:530)
>   at 
> org.apache.geode.internal.cache.PartitionedRegion.containsKey(PartitionedRegion.java:6402)
>   at 
> org.apache.geode.internal.cache.tier.sockets.command.ContainsKey66.cmdExecute(ContainsKey66.java:138)
>   at 
> org.apache.geode.internal.cache.tier.sockets.BaseCommand.execute(BaseCommand.java:177)
>   at 
> org.apache.geode.internal.cache.tier.sockets.ServerConnection.doNormalMessage(ServerConnection.java:848)
>   at 
> org.apache.geode.internal.cache.tier.sockets.OriginalServerConnection.doOneMessage(OriginalServerConnection.java:72)
>   at 
> org.apache.geode.internal.cache.tier.sockets.ServerConnection.run(ServerConnection.java:1212)
>   at 
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExec

[jira] [Updated] (GEODE-8442) Exception in server not identified correctly in client

2020-11-16 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-8442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated GEODE-8442:
--
Labels: pull-request-available  (was: )

> Exception in server not identified correctly in client
> --
>
> Key: GEODE-8442
> URL: https://issues.apache.org/jira/browse/GEODE-8442
> Project: Geode
>  Issue Type: Bug
>  Components: native client
>Reporter: Alberto Bustamante Reyes
>Assignee: Mario Kevo
>Priority: Major
>  Labels: pull-request-available
>
> Native client is not identifying correctly an exception returned by the 
> server.
> This is a log from an exception properly identified (I have introduced "" 
> where I think I have to hide sensitive information):
> {code}
> [error 2020/07/22 09:17:10.831337 UTC ] 
> CacheTransactionManager::failover: An exception 
> (org.apache.geode.cache.TransactionDataNodeHasDepartedException: Could not 
> find transaction host for TXId: 
> (default_GeodeDS:1:loner):3:Native_gdbdedfebe1:default_GeodeDS:6022
>   at 
> org.apache.geode.internal.cache.tier.sockets.command.TXFailoverCommand.cmdExecute(TXFailoverCommand.java:126)
>   at 
> org.apache.geode.internal.cache.tier.sockets.BaseCommand.execute(BaseCommand.java:177)
>   at 
> org.apache.geode.internal.cache.tier.sockets.ServerConnection.doNormalMessage(ServerConnection.java:848)
>   at 
> org.apache.geode.internal.cache.tier.sockets.OriginalServerConnection.doOneMessage(OriginalServerConnection.java:72)
>   at 
> org.apache.geode.internal.cache.tier.sockets.ServerConnection.run(ServerConnection.java:1212)
>   at 
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
>   at 
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
>   at 
> org.apache.geode.internal.cache.tier.sockets.AcceptorImpl.lambda$initializeServerConnectionThreadPool$3(AcceptorImpl.java:676)
>   at 
> org.apache.geode.logging.internal.executors.LoggingThreadFactory.lambda$newThread$0(LoggingThreadFactory.java:119)
>   at java.base/java.lang.Thread.run(Thread.java:834)
> ) happened at remote server.
> data-access: write_element:[TransactionDataNodeHasDepartedException] 
> .get()::apache::geode::client::TransactionDataNodeHasDepartedException:org.apache.geode.cache.TransactionDataNodeHasDepartedException:
>  Could not find transaction host for TXId: 
> (default_GeodeDS:1:loner):3:Native_gdbdedfebe1:default_GeodeDS:6022
> {code}
> But in this case, although the exception received is also a 
> TransactionDataNodeHasDepartedException, it is "translated" to an Unknown 
> exception:
> {code}
> [error 2020/07/22 09:17:10.979506 UTC ] Region::containsKeyOnServer:: An 
> exception (org.apache.geode.cache.TransactionDataNodeHasDepartedException: 
> Transaction data node 192.168.240.13(dms-server-1:1):41000 has departed. 
> To proceed, rollback this transaction and begin a new one., caused by 
> org.apache.geode.internal.cache.ForceReattemptException: PartitionResponse 
> got remote CacheClosedException
>   at 
> org.apache.geode.internal.cache.tx.PartitionedTXRegionStub.containsKey(PartitionedTXRegionStub.java:247)
>   at 
> org.apache.geode.internal.cache.TXStateStub.containsKey(TXStateStub.java:431)
>   at 
> org.apache.geode.internal.cache.TXStateProxyImpl.containsKey(TXStateProxyImpl.java:530)
>   at 
> org.apache.geode.internal.cache.PartitionedRegion.containsKey(PartitionedRegion.java:6402)
>   at 
> org.apache.geode.internal.cache.tier.sockets.command.ContainsKey66.cmdExecute(ContainsKey66.java:138)
>   at 
> org.apache.geode.internal.cache.tier.sockets.BaseCommand.execute(BaseCommand.java:177)
>   at 
> org.apache.geode.internal.cache.tier.sockets.ServerConnection.doNormalMessage(ServerConnection.java:848)
>   at 
> org.apache.geode.internal.cache.tier.sockets.OriginalServerConnection.doOneMessage(OriginalServerConnection.java:72)
>   at 
> org.apache.geode.internal.cache.tier.sockets.ServerConnection.run(ServerConnection.java:1212)
>   at 
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
>   at 
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
>   at 
> org.apache.geode.internal.cache.tier.sockets.AcceptorImpl.lambda$initializeServerConnectionThreadPool$3(AcceptorImpl.java:676)
>   at 
> org.apache.geode.logging.internal.executors.LoggingThreadFactory.lambda$newThread$0(LoggingThreadFactory.java:119)
>   at java.base/java.lang.Thread.run(Thread.java:834)
> Caused by: org.apache.geode.internal.cache.ForceReattemptException: 
> PartitionResponse got remote CacheClosedException
>   at 
> org.apache.geode.inte

[jira] [Commented] (GEODE-8614) Provide an specific client-side exception for server LowMemoryException

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232794#comment-17232794
 ] 

ASF GitHub Bot commented on GEODE-8614:
---

gaussianrecurrence commented on a change in pull request #688:
URL: https://github.com/apache/geode-native/pull/688#discussion_r524311436



##
File path: cppcache/integration/test/ExceptionTranslationTest.cpp
##
@@ -0,0 +1,125 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *  http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+#include 
+
+#include 
+
+#include 
+#include 
+#include 
+
+#include "framework/Cluster.h"
+
+using apache::geode::client::Cache;
+using apache::geode::client::CacheableKey;
+using apache::geode::client::CacheableString;
+using apache::geode::client::LowMemoryException;
+using apache::geode::client::Pool;
+using apache::geode::client::QueryExecutionLowMemoryException;
+using apache::geode::client::Region;
+using apache::geode::client::RegionShortcut;
+
+using std::chrono::minutes;
+
+namespace {
+Cache createCache() {
+  using apache::geode::client::CacheFactory;
+
+  auto cache = CacheFactory()
+   .set("log-level", "none")
+   .set("statistic-sampling-enabled", "false")
+   .create();
+
+  return cache;
+}
+
+std::shared_ptr createPool(Cluster& cluster, Cache& cache) {
+  auto poolFactory = cache.getPoolManager().createFactory();
+  cluster.applyLocators(poolFactory);
+  return poolFactory.create("default");
+}
+
+std::shared_ptr setupRegion(Cache& cache,
+const std::shared_ptr& pool) {
+  auto region = cache.createRegionFactory(RegionShortcut::PROXY)
+.setPoolName(pool->getName())
+.create("region");
+
+  return region;
+}
+
+void createEntries(const std::shared_ptr& region, std::size_t count,
+   std::size_t size) {
+  std::mt19937 generator{std::random_device{}()};
+  std::string base{"ABCDEFGHIJKLMNOPQRSTUVWXYZ"};
+
+  while (base.length() < size) {
+base += base;
+  }
+  base = base.substr(0, size);
+
+  for (std::size_t i = 0U; i < count;) {
+std::string value_str{base};
+std::shuffle(value_str.begin(), value_str.end(), generator);
+
+auto key = CacheableKey::create(std::to_string(i++));
+auto value = CacheableString::create(value_str);
+region->put(key, value);
+  }
+}
+}  // namespace
+
+TEST(ExceptionTranslationTest, testLowMemoryException) {
+  const auto ENTRY_SIZE = 1024U;
+  const auto ENTRIES = 1U << 20U;
+
+  Cluster cluster{LocatorCount{1}, ServerCount{1},
+  CacheXMLFiles{std::vector{

Review comment:
   I changed the cluster initialization so it improves readability. 
Hopefully that was what you went for with your comment :)





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Provide an specific client-side exception for server LowMemoryException
> ---
>
> Key: GEODE-8614
> URL: https://issues.apache.org/jira/browse/GEODE-8614
> Project: Geode
>  Issue Type: Improvement
>  Components: native client
>Affects Versions: 1.11.0, 1.12.0, 1.13.0
>Reporter: Mario Salazar de Torres
>Priority: Major
>  Labels: pull-request-available
>
> *AS AN* native client contributor
>  *I WANT* to have a client-side exception for LowMemoryException
>  *SO THAT* I can nofity accordingly from the client-side upon server 
> memory-depletion.
> —
> *Additional information*
>  This is the callstack of the LowMemoryException:
> {noformat}
> [error 2020/10/13 09:54:14.401405 UTC 140522117220352] Region::put: An 
> exception (org.apache.geode.cache.LowMemoryException: PartitionedRegion: 
> /part_a cannot process operation on key foo|0 because members 
> [192.168.240.14(dms-server-1:1):41000] are runnin

[jira] [Resolved] (GEODE-8684) Setting a session's maxInactiveInterval does not work when the commit valve is disabled

2020-11-16 Thread Sarah Abbey (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-8684?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sarah Abbey resolved GEODE-8684.

Fix Version/s: 1.14.0
   Resolution: Fixed

> Setting a session's maxInactiveInterval does not work when the commit valve 
> is disabled
> ---
>
> Key: GEODE-8684
> URL: https://issues.apache.org/jira/browse/GEODE-8684
> Project: Geode
>  Issue Type: Bug
>  Components: http session
>Affects Versions: 1.14.0
>Reporter: Jens Deppe
>Assignee: Jens Deppe
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> I was using the {{CommandServlet}}, in an external test, and noticed that 
> when the commit valve is disabled, attempting to set the session's 
> maxInactiveInterval does not cause the session's maxInactiveInterval to be 
> changed and it remains at the default.
> My setup consisted of 2 Geode servers and a single Tomcat instance (9.0.39). 
> I was checking the sessions with the following gfsh query:
> {code:java}
> query --query="select e.key,e.value.maxInactiveInterval from 
> /gemfire_modules_sessions.entries as e
> {code}
> My {{context.xml}} includes the following:
> {code:xml}
>className="org.apache.geode.modules.session.catalina.Tomcat8DeltaSessionManager"
>   enableLocalCache="false"
>   enableCommitValve="false"
>   regionAttributesId="PARTITION_REDUNDANT"
>   />
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8697) Propagate ForcedDisconnectException to the user application in a network partition scenario

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232844#comment-17232844
 ] 

ASF GitHub Bot commented on GEODE-8697:
---

bschuchardt merged pull request #5739:
URL: https://github.com/apache/geode/pull/5739


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Propagate ForcedDisconnectException to the user application in a network 
> partition scenario
> ---
>
> Key: GEODE-8697
> URL: https://issues.apache.org/jira/browse/GEODE-8697
> Project: Geode
>  Issue Type: Bug
>  Components: membership
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Kamilla Aslami
>Assignee: Bruce J Schuchardt
>Priority: Minor
>  Labels: pull-request-available
>
> During network partitioning, we expect that the coordinator closes its 
> cluster with a ForcedDisconnectException. However, in some cases, threads end 
> up with a MemberDisconnectedException.
> System logs show that a ForcedDisconnect has happened:
> {code:java}
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.forceDisconnect(GMSMembership.java:2007)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1085)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:1422)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1327)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1266)
>  at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
>  at org.jgroups.JChannel.up(JChannel.java:741)
>  at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
>  at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
>  at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
>  at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1077)
>  at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:792)
>  at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:433)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:72)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:70)
>  at org.jgroups.protocols.TP.passMessageUp(TP.java:1658)
>  at org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1876)
>  at org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
>  at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1789)
>  at org.jgroups.protocols.TP.receive(TP.java:1714)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:159)
>  at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
>  at java.lang.Thread.run(Thread.java:748){code}
> But it is never propagated upwards to the user application:
> {code:java}
> org.apache.geode.distributed.DistributedSystemDisconnectedException: This 
> connection to a distributed system has been disconnected., caused by 
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.checkConnected(InternalDistributedSystem.java:978)
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.getDistributionManager(InternalDistributedSystem.java:1679)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.getDistributionManager(ReplyProcessor21.java:366)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.postWait(ReplyProcessor21.java:600)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterruptibly(ReplyProcessor21.java:824)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterruptibly(ReplyProcessor21.java:779)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterruptibly(ReplyProcessor21.java:865)
>  at 
> org.apache.geode.internal.cache.partitio

[jira] [Commented] (GEODE-8697) Propagate ForcedDisconnectException to the user application in a network partition scenario

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232846#comment-17232846
 ] 

ASF subversion and git services commented on GEODE-8697:


Commit 403e19c0a2b85369274e8254c16e0ae508b82e94 in geode's branch 
refs/heads/develop from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=403e19c ]

GEODE-8697: Propagate ForcedDisconnectException to the user application (#5739)

* GEODE-8697: Propagate ForcedDisconnectException to the user application

avoid setting MemberDisconnectedException as a rootCause in
ClusterDistributionManager, even temporarily, as it's an internal
exception that should not be exposed to applications.

* addressing Ernie's comments

> Propagate ForcedDisconnectException to the user application in a network 
> partition scenario
> ---
>
> Key: GEODE-8697
> URL: https://issues.apache.org/jira/browse/GEODE-8697
> Project: Geode
>  Issue Type: Bug
>  Components: membership
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Kamilla Aslami
>Assignee: Bruce J Schuchardt
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> During network partitioning, we expect that the coordinator closes its 
> cluster with a ForcedDisconnectException. However, in some cases, threads end 
> up with a MemberDisconnectedException.
> System logs show that a ForcedDisconnect has happened:
> {code:java}
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.forceDisconnect(GMSMembership.java:2007)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1085)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:1422)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1327)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1266)
>  at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
>  at org.jgroups.JChannel.up(JChannel.java:741)
>  at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
>  at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
>  at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
>  at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1077)
>  at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:792)
>  at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:433)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:72)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:70)
>  at org.jgroups.protocols.TP.passMessageUp(TP.java:1658)
>  at org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1876)
>  at org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
>  at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1789)
>  at org.jgroups.protocols.TP.receive(TP.java:1714)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:159)
>  at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
>  at java.lang.Thread.run(Thread.java:748){code}
> But it is never propagated upwards to the user application:
> {code:java}
> org.apache.geode.distributed.DistributedSystemDisconnectedException: This 
> connection to a distributed system has been disconnected., caused by 
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.checkConnected(InternalDistributedSystem.java:978)
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.getDistributionManager(InternalDistributedSystem.java:1679)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.getDistributionManager(ReplyProcessor21.java:366)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.postWait(ReplyProcessor21.java:600)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterruptibly(ReplyProcessor21.java:824)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterruptibly(Reply

[jira] [Commented] (GEODE-8697) Propagate ForcedDisconnectException to the user application in a network partition scenario

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232845#comment-17232845
 ] 

ASF subversion and git services commented on GEODE-8697:


Commit 403e19c0a2b85369274e8254c16e0ae508b82e94 in geode's branch 
refs/heads/develop from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=403e19c ]

GEODE-8697: Propagate ForcedDisconnectException to the user application (#5739)

* GEODE-8697: Propagate ForcedDisconnectException to the user application

avoid setting MemberDisconnectedException as a rootCause in
ClusterDistributionManager, even temporarily, as it's an internal
exception that should not be exposed to applications.

* addressing Ernie's comments

> Propagate ForcedDisconnectException to the user application in a network 
> partition scenario
> ---
>
> Key: GEODE-8697
> URL: https://issues.apache.org/jira/browse/GEODE-8697
> Project: Geode
>  Issue Type: Bug
>  Components: membership
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Kamilla Aslami
>Assignee: Bruce J Schuchardt
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> During network partitioning, we expect that the coordinator closes its 
> cluster with a ForcedDisconnectException. However, in some cases, threads end 
> up with a MemberDisconnectedException.
> System logs show that a ForcedDisconnect has happened:
> {code:java}
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.forceDisconnect(GMSMembership.java:2007)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1085)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:1422)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1327)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1266)
>  at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
>  at org.jgroups.JChannel.up(JChannel.java:741)
>  at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
>  at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
>  at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
>  at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1077)
>  at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:792)
>  at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:433)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:72)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:70)
>  at org.jgroups.protocols.TP.passMessageUp(TP.java:1658)
>  at org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1876)
>  at org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
>  at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1789)
>  at org.jgroups.protocols.TP.receive(TP.java:1714)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:159)
>  at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
>  at java.lang.Thread.run(Thread.java:748){code}
> But it is never propagated upwards to the user application:
> {code:java}
> org.apache.geode.distributed.DistributedSystemDisconnectedException: This 
> connection to a distributed system has been disconnected., caused by 
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.checkConnected(InternalDistributedSystem.java:978)
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.getDistributionManager(InternalDistributedSystem.java:1679)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.getDistributionManager(ReplyProcessor21.java:366)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.postWait(ReplyProcessor21.java:600)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterruptibly(ReplyProcessor21.java:824)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterruptibly(Reply

[jira] [Resolved] (GEODE-8697) Propagate ForcedDisconnectException to the user application in a network partition scenario

2020-11-16 Thread Bruce J Schuchardt (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-8697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bruce J Schuchardt resolved GEODE-8697.
---
Fix Version/s: 1.14.0
   Resolution: Fixed

> Propagate ForcedDisconnectException to the user application in a network 
> partition scenario
> ---
>
> Key: GEODE-8697
> URL: https://issues.apache.org/jira/browse/GEODE-8697
> Project: Geode
>  Issue Type: Bug
>  Components: membership
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Kamilla Aslami
>Assignee: Bruce J Schuchardt
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> During network partitioning, we expect that the coordinator closes its 
> cluster with a ForcedDisconnectException. However, in some cases, threads end 
> up with a MemberDisconnectedException.
> System logs show that a ForcedDisconnect has happened:
> {code:java}
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.forceDisconnect(GMSMembership.java:2007)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1085)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:1422)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1327)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1266)
>  at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
>  at org.jgroups.JChannel.up(JChannel.java:741)
>  at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
>  at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
>  at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
>  at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1077)
>  at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:792)
>  at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:433)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:72)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:70)
>  at org.jgroups.protocols.TP.passMessageUp(TP.java:1658)
>  at org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1876)
>  at org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
>  at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1789)
>  at org.jgroups.protocols.TP.receive(TP.java:1714)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:159)
>  at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
>  at java.lang.Thread.run(Thread.java:748){code}
> But it is never propagated upwards to the user application:
> {code:java}
> org.apache.geode.distributed.DistributedSystemDisconnectedException: This 
> connection to a distributed system has been disconnected., caused by 
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.checkConnected(InternalDistributedSystem.java:978)
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.getDistributionManager(InternalDistributedSystem.java:1679)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.getDistributionManager(ReplyProcessor21.java:366)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.postWait(ReplyProcessor21.java:600)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterruptibly(ReplyProcessor21.java:824)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterruptibly(ReplyProcessor21.java:779)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterruptibly(ReplyProcessor21.java:865)
>  at 
> org.apache.geode.internal.cache.partitioned.FetchKeysMessage$FetchKeysResponse.waitForKeys(FetchKeysMessage.java:584)
>  at 
> org.apache.geode.internal.cache.PartitionedRegion.getBucketKeys(PartitionedRegion.java:4463)
>  at 
> org.apache.geode.internal.cache.PartitionedRegionDataView.getBucketKeys(PartitionedRegionDataView.java:118)
>  at 
> org.apache.geode.internal.cache.PartitionedRegion$KeysSet$KeysSetIterat

[jira] [Resolved] (GEODE-8670) ReconnectDUnitTest is hiding a NullPointerException

2020-11-16 Thread Bruce J Schuchardt (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-8670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bruce J Schuchardt resolved GEODE-8670.
---
Fix Version/s: 1.14.0
   Resolution: Fixed

> ReconnectDUnitTest is hiding a NullPointerException
> ---
>
> Key: GEODE-8670
> URL: https://issues.apache.org/jira/browse/GEODE-8670
> Project: Geode
>  Issue Type: Test
>  Components: membership, tests
>Affects Versions: 1.14.0
>Reporter: Jens Deppe
>Assignee: Bruce J Schuchardt
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> Since working on GEODE-8609, I found that the regex in ReconnectDUnitTest is 
> incorrect. Specifically it looks like this:
> {noformat}
> IgnoredException.addIgnoredException("ForcedDisconnectException||Possible 
> loss of quorum");
> {noformat}
> When removing the double pipe ({{||}}) various test methods fail with a 
> {{NullPointerException}}. For example:
> {noformat}
> java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
> Fix the strings or use IgnoredException.addIgnoredException to ignore.
> ---
> Found suspect string in 'dunit_suspect-vm1.log' at line 692
> [fatal 2020/10/29 05:59:26.027 PDT  
> tid=203] Uncaught exception in thread Thread[Location services restart 
> thread,5,RMI Runtime]
> java.lang.NullPointerException
>   at 
> org.apache.geode.distributed.internal.InternalLocator.attemptReconnect(InternalLocator.java:1108)
>   at 
> org.apache.geode.distributed.internal.InternalLocator.lambda$launchRestartThread$4(InternalLocator.java:1070)
>   at java.lang.Thread.run(Thread.java:748)
> {noformat}
> There is also an additional {{DistributedSystemDisconnectedException}} which 
> probably just requires an addition to the other ignored exceptions.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (GEODE-8670) ReconnectDUnitTest is hiding a NullPointerException

2020-11-16 Thread Bruce J Schuchardt (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-8670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bruce J Schuchardt reassigned GEODE-8670:
-

Assignee: Bruce J Schuchardt

> ReconnectDUnitTest is hiding a NullPointerException
> ---
>
> Key: GEODE-8670
> URL: https://issues.apache.org/jira/browse/GEODE-8670
> Project: Geode
>  Issue Type: Test
>  Components: membership, tests
>Affects Versions: 1.14.0
>Reporter: Jens Deppe
>Assignee: Bruce J Schuchardt
>Priority: Major
>  Labels: pull-request-available
>
> Since working on GEODE-8609, I found that the regex in ReconnectDUnitTest is 
> incorrect. Specifically it looks like this:
> {noformat}
> IgnoredException.addIgnoredException("ForcedDisconnectException||Possible 
> loss of quorum");
> {noformat}
> When removing the double pipe ({{||}}) various test methods fail with a 
> {{NullPointerException}}. For example:
> {noformat}
> java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
> Fix the strings or use IgnoredException.addIgnoredException to ignore.
> ---
> Found suspect string in 'dunit_suspect-vm1.log' at line 692
> [fatal 2020/10/29 05:59:26.027 PDT  
> tid=203] Uncaught exception in thread Thread[Location services restart 
> thread,5,RMI Runtime]
> java.lang.NullPointerException
>   at 
> org.apache.geode.distributed.internal.InternalLocator.attemptReconnect(InternalLocator.java:1108)
>   at 
> org.apache.geode.distributed.internal.InternalLocator.lambda$launchRestartThread$4(InternalLocator.java:1070)
>   at java.lang.Thread.run(Thread.java:748)
> {noformat}
> There is also an additional {{DistributedSystemDisconnectedException}} which 
> probably just requires an addition to the other ignored exceptions.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (GEODE-8715) Redis INFO command 'Keyspace' section should not be present if no keys in database

2020-11-16 Thread John Hutchison (Jira)
John Hutchison created GEODE-8715:
-

 Summary: Redis INFO command 'Keyspace' section should not be 
present if no keys in database
 Key: GEODE-8715
 URL: https://issues.apache.org/jira/browse/GEODE-8715
 Project: Geode
  Issue Type: Improvement
  Components: redis
Reporter: John Hutchison






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8715) Redis INFO command 'Keyspace' section should not be present if no keys in database

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232865#comment-17232865
 ] 

ASF GitHub Bot commented on GEODE-8715:
---

jhutchison commented on pull request #5748:
URL: https://github.com/apache/geode/pull/5748#issuecomment-728161074


   > Seems good! Just looks like it needs a Jira ticket. 
[GEODE-8706](https://issues.apache.org/jira/projects/GEODE/issues/GEODE-8706) 
looks like it's for a different issue.
   
   so it does-  weird- not sure how that happened \_?_/ .   thanks- fixed.   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Redis INFO command 'Keyspace' section should not be present if no keys in 
> database
> --
>
> Key: GEODE-8715
> URL: https://issues.apache.org/jira/browse/GEODE-8715
> Project: Geode
>  Issue Type: Improvement
>  Components: redis
>Reporter: John Hutchison
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (GEODE-8715) Redis INFO command 'Keyspace' section should not be present if no keys in database

2020-11-16 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-8715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated GEODE-8715:
--
Labels: pull-request-available  (was: )

> Redis INFO command 'Keyspace' section should not be present if no keys in 
> database
> --
>
> Key: GEODE-8715
> URL: https://issues.apache.org/jira/browse/GEODE-8715
> Project: Geode
>  Issue Type: Improvement
>  Components: redis
>Reporter: John Hutchison
>Priority: Major
>  Labels: pull-request-available
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8715) Redis INFO command 'Keyspace' section should not be present if no keys in database

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232876#comment-17232876
 ] 

ASF GitHub Bot commented on GEODE-8715:
---

jhutchison opened a new pull request #5753:
URL: https://github.com/apache/geode/pull/5753


   …nt if no keys in database
   
   Thank you for submitting a contribution to Apache Geode.
   
   In order to streamline the review of the contribution we ask you
   to ensure the following steps have been taken:
   
   ### For all changes:
   - [ ] Is there a JIRA ticket associated with this PR? Is it referenced in 
the commit message?
   
   - [ ] Has your PR been rebased against the latest commit within the target 
branch (typically `develop`)?
   
   - [ ] Is your initial contribution a single, squashed commit?
   
   - [ ] Does `gradlew build` run cleanly?
   
   - [ ] Have you written or updated unit tests to verify your changes?
   
   - [ ] If adding new dependencies to the code, are these dependencies 
licensed in a way that is compatible for inclusion under [ASF 
2.0](http://www.apache.org/legal/resolved.html#category-a)?
   
   ### Note:
   Please ensure that once the PR is submitted, check Concourse for build 
issues and
   submit an update to your PR as soon as possible. If you need help, please 
send an
   email to d...@geode.apache.org.
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Redis INFO command 'Keyspace' section should not be present if no keys in 
> database
> --
>
> Key: GEODE-8715
> URL: https://issues.apache.org/jira/browse/GEODE-8715
> Project: Geode
>  Issue Type: Improvement
>  Components: redis
>Reporter: John Hutchison
>Priority: Major
>  Labels: pull-request-available
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-7710) JMXMBeanReconnectDUnitTest fails intermittently because one locator is missing the LockServiceMXBean

2020-11-16 Thread Geode Integration (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-7710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232879#comment-17232879
 ] 

Geode Integration commented on GEODE-7710:
--

Seen in [DistributedTestOpenJDK8 
#624|https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/DistributedTestOpenJDK8/builds/624]
 ... see [test 
results|http://files.apachegeode-ci.info/builds/apache-develop-main/1.14.0-build.0487/test-results/distributedTest/1605328331/]
 or download 
[artifacts|http://files.apachegeode-ci.info/builds/apache-develop-main/1.14.0-build.0487/test-artifacts/1605328331/distributedtestfiles-OpenJDK8-1.14.0-build.0487.tgz].

> JMXMBeanReconnectDUnitTest fails intermittently because one locator is 
> missing the LockServiceMXBean
> 
>
> Key: GEODE-7710
> URL: https://issues.apache.org/jira/browse/GEODE-7710
> Project: Geode
>  Issue Type: Bug
>  Components: tests
>Affects Versions: 1.13.0
>Reporter: Kirk Lund
>Assignee: Kirk Lund
>Priority: Major
>  Labels: flaky
>  Time Spent: 5h
>  Remaining Estimate: 0h
>
> Multiple tests in JMXMBeanReconnectDUnitTest may fail an await due to one of 
> the locators missing the LockServiceMXBean for the cluster config service.
> {noformat}
> but could not find:
>  
> <[GemFire:service=LockService,name=__CLUSTER_CONFIG_LS,type=Member,member=locator1]>
> {noformat}
> These test failures are caused by *GEODE-7739*.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8715) Redis INFO command 'Keyspace' section should not be present if no keys in database

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232883#comment-17232883
 ] 

ASF GitHub Bot commented on GEODE-8715:
---

jhutchison commented on pull request #5748:
URL: https://github.com/apache/geode/pull/5748#issuecomment-728173547


   > > Seems good! Just looks like it needs a Jira ticket. 
[GEODE-8706](https://issues.apache.org/jira/projects/GEODE/issues/GEODE-8706) 
looks like it's for a different issue.
   > 
   > so it does- weird- not sure how that happened _?_/ . thanks- fixed.
   
   re-opened pr in https://github.com/apache/geode/pull/5753 with renamed 
branch and pr.  closing this pr.  



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Redis INFO command 'Keyspace' section should not be present if no keys in 
> database
> --
>
> Key: GEODE-8715
> URL: https://issues.apache.org/jira/browse/GEODE-8715
> Project: Geode
>  Issue Type: Improvement
>  Components: redis
>Reporter: John Hutchison
>Priority: Major
>  Labels: pull-request-available
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8715) Redis INFO command 'Keyspace' section should not be present if no keys in database

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232885#comment-17232885
 ] 

ASF GitHub Bot commented on GEODE-8715:
---

jhutchison commented on pull request #5748:
URL: https://github.com/apache/geode/pull/5748#issuecomment-728174115


   mistakenly assigned unrelated ticket.  Pr closed and reopened in new pr 
https://github.com/apache/geode/pull/5753 with matching branch name



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Redis INFO command 'Keyspace' section should not be present if no keys in 
> database
> --
>
> Key: GEODE-8715
> URL: https://issues.apache.org/jira/browse/GEODE-8715
> Project: Geode
>  Issue Type: Improvement
>  Components: redis
>Reporter: John Hutchison
>Priority: Major
>  Labels: pull-request-available
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8715) Redis INFO command 'Keyspace' section should not be present if no keys in database

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232884#comment-17232884
 ] 

ASF GitHub Bot commented on GEODE-8715:
---

jhutchison closed pull request #5748:
URL: https://github.com/apache/geode/pull/5748


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Redis INFO command 'Keyspace' section should not be present if no keys in 
> database
> --
>
> Key: GEODE-8715
> URL: https://issues.apache.org/jira/browse/GEODE-8715
> Project: Geode
>  Issue Type: Improvement
>  Components: redis
>Reporter: John Hutchison
>Priority: Major
>  Labels: pull-request-available
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8715) Redis INFO command 'Keyspace' section should not be present if no keys in database

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232888#comment-17232888
 ] 

ASF GitHub Bot commented on GEODE-8715:
---

jhutchison commented on pull request #5753:
URL: https://github.com/apache/geode/pull/5753#issuecomment-728174609


   previously submitted under https://github.com/apache/geode/pull/5748



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Redis INFO command 'Keyspace' section should not be present if no keys in 
> database
> --
>
> Key: GEODE-8715
> URL: https://issues.apache.org/jira/browse/GEODE-8715
> Project: Geode
>  Issue Type: Improvement
>  Components: redis
>Reporter: John Hutchison
>Priority: Major
>  Labels: pull-request-available
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (GEODE-8716) TombstoneDUnitTests fail with a BindException creating a cache server on the default port

2020-11-16 Thread Bruce J Schuchardt (Jira)
Bruce J Schuchardt created GEODE-8716:
-

 Summary: TombstoneDUnitTests fail with a BindException creating a 
cache server on the default port
 Key: GEODE-8716
 URL: https://issues.apache.org/jira/browse/GEODE-8716
 Project: Geode
  Issue Type: Bug
  Components: tests
Reporter: Bruce J Schuchardt


The test should be changed to do a wildcard bind (setPort(0)).

{noformat}
org.apache.geode.internal.cache.versions.TombstoneDUnitTest > 
testGetOldestTombstoneNonReplicate FAILED
org.apache.geode.test.dunit.RMIException: While invoking 
org.apache.geode.internal.cache.versions.TombstoneDUnitTest$$Lambda$46/1195148372.run
 in VM 1 running on Host e45ce8a4541a with 4 VMs
at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:623)
at org.apache.geode.test.dunit.VM.invoke(VM.java:447)
at 
org.apache.geode.internal.cache.versions.TombstoneDUnitTest.testGetOldestTombstoneNonReplicate(TombstoneDUnitTest.java:223)

Caused by:
java.net.BindException: Failed to create server socket on 
172.17.0.20[40404]
at 
org.apache.geode.distributed.internal.tcpserver.ClusterSocketCreatorImpl.createServerSocket(ClusterSocketCreatorImpl.java:75)
at 
org.apache.geode.internal.net.SCClusterSocketCreator.createServerSocket(SCClusterSocketCreator.java:55)
at 
org.apache.geode.internal.net.SocketCreator.createServerSocket(SocketCreator.java:726)
at 
org.apache.geode.internal.cache.tier.sockets.AcceptorImpl.(AcceptorImpl.java:574)
at 
org.apache.geode.internal.cache.tier.sockets.AcceptorBuilder.create(AcceptorBuilder.java:303)
at 
org.apache.geode.internal.cache.CacheServerImpl.createAcceptor(CacheServerImpl.java:427)
at 
org.apache.geode.internal.cache.CacheServerImpl.start(CacheServerImpl.java:384)
at 
org.apache.geode.internal.cache.versions.TombstoneDUnitTest.lambda$testGetOldestTombstoneNonReplicate$bb17a952$1(TombstoneDUnitTest.java:225)

Caused by:
java.net.BindException: Address already in use (Bind failed)
at java.net.PlainSocketImpl.socketBind(Native Method)
at 
java.net.AbstractPlainSocketImpl.bind(AbstractPlainSocketImpl.java:387)
at java.net.ServerSocket.bind(ServerSocket.java:390)
at 
org.apache.geode.distributed.internal.tcpserver.ClusterSocketCreatorImpl.createServerSocket(ClusterSocketCreatorImpl.java:72)
... 7 more

org.apache.geode.internal.cache.versions.TombstoneDUnitTest > 
testGetOldestTombstoneTimeNonReplicate FAILED
org.apache.geode.test.dunit.RMIException: While invoking 
org.apache.geode.internal.cache.versions.TombstoneDUnitTest$$Lambda$52/1794955615.run
 in VM 1 running on Host e45ce8a4541a with 4 VMs
at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:623)
at org.apache.geode.test.dunit.VM.invoke(VM.java:447)
at 
org.apache.geode.internal.cache.versions.TombstoneDUnitTest.testGetOldestTombstoneTimeNonReplicate(TombstoneDUnitTest.java:151)

Caused by:
java.net.BindException: Failed to create server socket on 
172.17.0.20[40404]
at 
org.apache.geode.distributed.internal.tcpserver.ClusterSocketCreatorImpl.createServerSocket(ClusterSocketCreatorImpl.java:75)
at 
org.apache.geode.internal.net.SCClusterSocketCreator.createServerSocket(SCClusterSocketCreator.java:55)
at 
org.apache.geode.internal.net.SocketCreator.createServerSocket(SocketCreator.java:726)
at 
org.apache.geode.internal.cache.tier.sockets.AcceptorImpl.(AcceptorImpl.java:574)
at 
org.apache.geode.internal.cache.tier.sockets.AcceptorBuilder.create(AcceptorBuilder.java:303)
at 
org.apache.geode.internal.cache.CacheServerImpl.createAcceptor(CacheServerImpl.java:427)
at 
org.apache.geode.internal.cache.CacheServerImpl.start(CacheServerImpl.java:384)
at 
org.apache.geode.internal.cache.versions.TombstoneDUnitTest.lambda$testGetOldestTombstoneTimeNonReplicate$bb17a952$1(TombstoneDUnitTest.java:153)

Caused by:
java.net.BindException: Address already in use (Bind failed)
at java.net.PlainSocketImpl.socketBind(Native Method)
at 
java.net.AbstractPlainSocketImpl.bind(AbstractPlainSocketImpl.java:387)
at java.net.ServerSocket.bind(ServerSocket.java:390)
at 
org.apache.geode.distributed.internal.tcpserver.ClusterSocketCreatorImpl.createServerSocket(ClusterSocketCreatorImpl.java:72)
... 7 more
{noformat}

https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/DistributedTestOpenJDK8/builds/623



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (GEODE-8716) TombstoneDUnitTests fail with a BindException creating a cache server on the default port

2020-11-16 Thread Bruce J Schuchardt (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bruce J Schuchardt reassigned GEODE-8716:
-

Assignee: Mark Hanson

> TombstoneDUnitTests fail with a BindException creating a cache server on the 
> default port
> -
>
> Key: GEODE-8716
> URL: https://issues.apache.org/jira/browse/GEODE-8716
> Project: Geode
>  Issue Type: Bug
>  Components: tests
>Reporter: Bruce J Schuchardt
>Assignee: Mark Hanson
>Priority: Major
>
> The test should be changed to do a wildcard bind (setPort(0)).
> {noformat}
> org.apache.geode.internal.cache.versions.TombstoneDUnitTest > 
> testGetOldestTombstoneNonReplicate FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.versions.TombstoneDUnitTest$$Lambda$46/1195148372.run
>  in VM 1 running on Host e45ce8a4541a with 4 VMs
> at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:623)
> at org.apache.geode.test.dunit.VM.invoke(VM.java:447)
> at 
> org.apache.geode.internal.cache.versions.TombstoneDUnitTest.testGetOldestTombstoneNonReplicate(TombstoneDUnitTest.java:223)
> Caused by:
> java.net.BindException: Failed to create server socket on 
> 172.17.0.20[40404]
> at 
> org.apache.geode.distributed.internal.tcpserver.ClusterSocketCreatorImpl.createServerSocket(ClusterSocketCreatorImpl.java:75)
> at 
> org.apache.geode.internal.net.SCClusterSocketCreator.createServerSocket(SCClusterSocketCreator.java:55)
> at 
> org.apache.geode.internal.net.SocketCreator.createServerSocket(SocketCreator.java:726)
> at 
> org.apache.geode.internal.cache.tier.sockets.AcceptorImpl.(AcceptorImpl.java:574)
> at 
> org.apache.geode.internal.cache.tier.sockets.AcceptorBuilder.create(AcceptorBuilder.java:303)
> at 
> org.apache.geode.internal.cache.CacheServerImpl.createAcceptor(CacheServerImpl.java:427)
> at 
> org.apache.geode.internal.cache.CacheServerImpl.start(CacheServerImpl.java:384)
> at 
> org.apache.geode.internal.cache.versions.TombstoneDUnitTest.lambda$testGetOldestTombstoneNonReplicate$bb17a952$1(TombstoneDUnitTest.java:225)
> Caused by:
> java.net.BindException: Address already in use (Bind failed)
> at java.net.PlainSocketImpl.socketBind(Native Method)
> at 
> java.net.AbstractPlainSocketImpl.bind(AbstractPlainSocketImpl.java:387)
> at java.net.ServerSocket.bind(ServerSocket.java:390)
> at 
> org.apache.geode.distributed.internal.tcpserver.ClusterSocketCreatorImpl.createServerSocket(ClusterSocketCreatorImpl.java:72)
> ... 7 more
> org.apache.geode.internal.cache.versions.TombstoneDUnitTest > 
> testGetOldestTombstoneTimeNonReplicate FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.versions.TombstoneDUnitTest$$Lambda$52/1794955615.run
>  in VM 1 running on Host e45ce8a4541a with 4 VMs
> at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:623)
> at org.apache.geode.test.dunit.VM.invoke(VM.java:447)
> at 
> org.apache.geode.internal.cache.versions.TombstoneDUnitTest.testGetOldestTombstoneTimeNonReplicate(TombstoneDUnitTest.java:151)
> Caused by:
> java.net.BindException: Failed to create server socket on 
> 172.17.0.20[40404]
> at 
> org.apache.geode.distributed.internal.tcpserver.ClusterSocketCreatorImpl.createServerSocket(ClusterSocketCreatorImpl.java:75)
> at 
> org.apache.geode.internal.net.SCClusterSocketCreator.createServerSocket(SCClusterSocketCreator.java:55)
> at 
> org.apache.geode.internal.net.SocketCreator.createServerSocket(SocketCreator.java:726)
> at 
> org.apache.geode.internal.cache.tier.sockets.AcceptorImpl.(AcceptorImpl.java:574)
> at 
> org.apache.geode.internal.cache.tier.sockets.AcceptorBuilder.create(AcceptorBuilder.java:303)
> at 
> org.apache.geode.internal.cache.CacheServerImpl.createAcceptor(CacheServerImpl.java:427)
> at 
> org.apache.geode.internal.cache.CacheServerImpl.start(CacheServerImpl.java:384)
> at 
> org.apache.geode.internal.cache.versions.TombstoneDUnitTest.lambda$testGetOldestTombstoneTimeNonReplicate$bb17a952$1(TombstoneDUnitTest.java:153)
> Caused by:
> java.net.BindException: Address already in use (Bind failed)
> at java.net.PlainSocketImpl.socketBind(Native Method)
> at 
> java.net.AbstractPlainSocketImpl.bind(AbstractPlainSocketImpl.java:387)
> at java.net.ServerSocket.bind(Server

[jira] [Commented] (GEODE-8716) TombstoneDUnitTests fail with a BindException creating a cache server on the default port

2020-11-16 Thread Geode Integration (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232891#comment-17232891
 ] 

Geode Integration commented on GEODE-8716:
--

Seen in [DistributedTestOpenJDK8 
#623|https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/DistributedTestOpenJDK8/builds/623]
 ... see [test 
results|http://files.apachegeode-ci.info/builds/apache-develop-main/1.14.0-build.0487/test-results/distributedTest/1605319536/]
 or download 
[artifacts|http://files.apachegeode-ci.info/builds/apache-develop-main/1.14.0-build.0487/test-artifacts/1605319536/distributedtestfiles-OpenJDK8-1.14.0-build.0487.tgz].

> TombstoneDUnitTests fail with a BindException creating a cache server on the 
> default port
> -
>
> Key: GEODE-8716
> URL: https://issues.apache.org/jira/browse/GEODE-8716
> Project: Geode
>  Issue Type: Bug
>  Components: tests
>Reporter: Bruce J Schuchardt
>Assignee: Mark Hanson
>Priority: Major
>
> The test should be changed to do a wildcard bind (setPort(0)).
> {noformat}
> org.apache.geode.internal.cache.versions.TombstoneDUnitTest > 
> testGetOldestTombstoneNonReplicate FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.versions.TombstoneDUnitTest$$Lambda$46/1195148372.run
>  in VM 1 running on Host e45ce8a4541a with 4 VMs
> at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:623)
> at org.apache.geode.test.dunit.VM.invoke(VM.java:447)
> at 
> org.apache.geode.internal.cache.versions.TombstoneDUnitTest.testGetOldestTombstoneNonReplicate(TombstoneDUnitTest.java:223)
> Caused by:
> java.net.BindException: Failed to create server socket on 
> 172.17.0.20[40404]
> at 
> org.apache.geode.distributed.internal.tcpserver.ClusterSocketCreatorImpl.createServerSocket(ClusterSocketCreatorImpl.java:75)
> at 
> org.apache.geode.internal.net.SCClusterSocketCreator.createServerSocket(SCClusterSocketCreator.java:55)
> at 
> org.apache.geode.internal.net.SocketCreator.createServerSocket(SocketCreator.java:726)
> at 
> org.apache.geode.internal.cache.tier.sockets.AcceptorImpl.(AcceptorImpl.java:574)
> at 
> org.apache.geode.internal.cache.tier.sockets.AcceptorBuilder.create(AcceptorBuilder.java:303)
> at 
> org.apache.geode.internal.cache.CacheServerImpl.createAcceptor(CacheServerImpl.java:427)
> at 
> org.apache.geode.internal.cache.CacheServerImpl.start(CacheServerImpl.java:384)
> at 
> org.apache.geode.internal.cache.versions.TombstoneDUnitTest.lambda$testGetOldestTombstoneNonReplicate$bb17a952$1(TombstoneDUnitTest.java:225)
> Caused by:
> java.net.BindException: Address already in use (Bind failed)
> at java.net.PlainSocketImpl.socketBind(Native Method)
> at 
> java.net.AbstractPlainSocketImpl.bind(AbstractPlainSocketImpl.java:387)
> at java.net.ServerSocket.bind(ServerSocket.java:390)
> at 
> org.apache.geode.distributed.internal.tcpserver.ClusterSocketCreatorImpl.createServerSocket(ClusterSocketCreatorImpl.java:72)
> ... 7 more
> org.apache.geode.internal.cache.versions.TombstoneDUnitTest > 
> testGetOldestTombstoneTimeNonReplicate FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.versions.TombstoneDUnitTest$$Lambda$52/1794955615.run
>  in VM 1 running on Host e45ce8a4541a with 4 VMs
> at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:623)
> at org.apache.geode.test.dunit.VM.invoke(VM.java:447)
> at 
> org.apache.geode.internal.cache.versions.TombstoneDUnitTest.testGetOldestTombstoneTimeNonReplicate(TombstoneDUnitTest.java:151)
> Caused by:
> java.net.BindException: Failed to create server socket on 
> 172.17.0.20[40404]
> at 
> org.apache.geode.distributed.internal.tcpserver.ClusterSocketCreatorImpl.createServerSocket(ClusterSocketCreatorImpl.java:75)
> at 
> org.apache.geode.internal.net.SCClusterSocketCreator.createServerSocket(SCClusterSocketCreator.java:55)
> at 
> org.apache.geode.internal.net.SocketCreator.createServerSocket(SocketCreator.java:726)
> at 
> org.apache.geode.internal.cache.tier.sockets.AcceptorImpl.(AcceptorImpl.java:574)
> at 
> org.apache.geode.internal.cache.tier.sockets.AcceptorBuilder.create(AcceptorBuilder.java:303)
> at 
> org.apache.geode.internal.cache.CacheServerImpl.createAcceptor(CacheServerImpl.java:427)
> at 
> org.apache.geode.internal.cache.CacheServerImpl.start(Cache

[jira] [Created] (GEODE-8717) Redis INFO Command should only return specified sections when given parameter

2020-11-16 Thread John Hutchison (Jira)
John Hutchison created GEODE-8717:
-

 Summary: Redis INFO Command should only return specified sections 
when given parameter
 Key: GEODE-8717
 URL: https://issues.apache.org/jira/browse/GEODE-8717
 Project: Geode
  Issue Type: Improvement
  Components: redis
Reporter: John Hutchison






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8573) SerialGatewaySenderOperationsDistributedTest.testRestartSerialGatewaySendersWhilePutting

2020-11-16 Thread Geode Integration (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8573?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232893#comment-17232893
 ] 

Geode Integration commented on GEODE-8573:
--

Seen in [DistributedTestOpenJDK11 
#592|https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/DistributedTestOpenJDK11/builds/592]
 ... see [test 
results|http://files.apachegeode-ci.info/builds/apache-develop-main/1.14.0-build.0485/test-results/distributedTest/1605308485/]
 or download 
[artifacts|http://files.apachegeode-ci.info/builds/apache-develop-main/1.14.0-build.0485/test-artifacts/1605308485/distributedtestfiles-OpenJDK11-1.14.0-build.0485.tgz].

> SerialGatewaySenderOperationsDistributedTest.testRestartSerialGatewaySendersWhilePutting
> 
>
> Key: GEODE-8573
> URL: https://issues.apache.org/jira/browse/GEODE-8573
> Project: Geode
>  Issue Type: Bug
>  Components: wan
>Affects Versions: 1.14.0
>Reporter: Mark Hanson
>Assignee: Kirk Lund
>Priority: Major
> Fix For: 1.14.0
>
>
> [https://concourse.apachegeode-ci.info/teams/main/pipelines/apache-develop-main/jobs/DistributedTestOpenJDK8/builds/521]
>  
> {noformat}
> org.apache.geode.internal.cache.wan.serial.SerialGatewaySenderOperationsDistributedTest
>  > testRestartSerialGatewaySendersWhilePutting[1: numDispatchers=3] FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.wan.serial.SerialGatewaySenderOperationsDistributedTest$$Lambda$356/426344166.run
>  in VM 5 running on Host 538fd3213f62 with 8 VMs
> at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:620)
> at org.apache.geode.test.dunit.VM.invoke(VM.java:447)
> at 
> org.apache.geode.internal.cache.wan.serial.SerialGatewaySenderOperationsDistributedTest.testRestartSerialGatewaySendersWhilePutting(SerialGatewaySenderOperationsDistributedTest.java:407)
> Caused by:
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.internal.cache.wan.serial.SerialGatewaySenderOperationsDistributedTest
>  that uses org.apache.geode.internal.cache.wan.InternalGatewaySender, 
> org.apache.geode.internal.cache.wan.InternalGatewaySenderint [Sender 
> statistics unprocessed event map size] expected:<[0]> but was:<[3]> within 5 
> minutes.
> at 
> org.awaitility.core.ConditionAwaiter.await(ConditionAwaiter.java:165)
> at 
> org.awaitility.core.AssertionCondition.await(AssertionCondition.java:119)
> at 
> org.awaitility.core.AssertionCondition.await(AssertionCondition.java:31)
> at 
> org.awaitility.core.ConditionFactory.until(ConditionFactory.java:895)
> at 
> org.awaitility.core.ConditionFactory.untilAsserted(ConditionFactory.java:679)
> at 
> org.apache.geode.internal.cache.wan.serial.SerialGatewaySenderOperationsDistributedTest.validateSecondaryQueueSizeStat(SerialGatewaySenderOperationsDistributedTest.java:1216)
> at 
> org.apache.geode.internal.cache.wan.serial.SerialGatewaySenderOperationsDistributedTest.lambda$testRestartSerialGatewaySendersWhilePutting$bb17a952$23(SerialGatewaySenderOperationsDistributedTest.java:407)
> Caused by:
> org.junit.ComparisonFailure: [Sender statistics unprocessed event 
> map size] expected:<[0]> but was:<[3]>
> at 
> sun.reflect.GeneratedConstructorAccessor81.newInstance(Unknown Source)
> at 
> sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
> at 
> org.apache.geode.internal.cache.wan.serial.SerialGatewaySenderOperationsDistributedTest.lambda$validateSecondaryQueueSizeStat$8(SerialGatewaySenderOperationsDistributedTest.java:1219)
>  {noformat}
>  
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-= Test Results URI 
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
>  
> [http://files.apachegeode-ci.info/builds/apache-develop-main/1.14.0-build.0389/test-results/distributedTest/1601774166/]
>  
> =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
> Test report artifacts from this job are available at:
> [http://files.apachegeode-ci.info/builds/apache-develop-main/1.14.0-build.0389/test-artifacts/1601774166/distributedtestfiles-OpenJDK8-1.14.0-build.0389.tgz]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (GEODE-8713) Geode-Native .NET user guide: API reference links point to C++ API

2020-11-16 Thread Dave Barnes (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-8713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dave Barnes reassigned GEODE-8713:
--

Assignee: Dave Barnes

> Geode-Native .NET user guide: API reference links point to C++ API
> --
>
> Key: GEODE-8713
> URL: https://issues.apache.org/jira/browse/GEODE-8713
> Project: Geode
>  Issue Type: Bug
>  Components: docs
>Affects Versions: 1.13.0
>Reporter: Dave Barnes
>Assignee: Dave Barnes
>Priority: Major
>
> The "Client Cache XML Reference" section in the .NET user guide contains 
> links to "API Class Reference" that all point to the C++ API ref. These 
> should point to the .NET API ref.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (GEODE-8672) Concurrent transactional destroy with GII could cause an entry to be removed and version information to be lost

2020-11-16 Thread Eric Shu (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-8672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eric Shu resolved GEODE-8672.
-
Fix Version/s: 1.14.0
   Resolution: Fixed

> Concurrent transactional destroy with GII could cause an entry to be removed 
> and version information to be lost
> ---
>
> Key: GEODE-8672
> URL: https://issues.apache.org/jira/browse/GEODE-8672
> Project: Geode
>  Issue Type: Bug
>  Components: regions
>Affects Versions: 1.1.0
>Reporter: Eric Shu
>Assignee: Eric Shu
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> In a newly rebalanced bucket, while GII is in progress, a transactional 
> destroy is applied to cache. There is a logic that it should be in token mode 
> and leaves the entry as a Destroyed token, even though the version tag of the 
> entry indicates that it has the correct version.
> However, at end of the GII, there is a 
> cleanUpDestroyedTokensAndMarkGIIComplete method removes all the destroyed 
> entries – this wipes off the entry version tag information and cause the 
> subsequent creates starts fresh with new version tags.
> This could leads to client server data inconsistency as the newly created 
> entries will be ignored by the clients as the newly created entry has lower 
> version number while client has high ones.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8711) Enable SLOWLOG command to support Redis monitoring tools

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232949#comment-17232949
 ] 

ASF GitHub Bot commented on GEODE-8711:
---

sabbey37 commented on a change in pull request #5749:
URL: https://github.com/apache/geode/pull/5749#discussion_r524387822



##
File path: 
geode-redis/src/main/java/org/apache/geode/redis/internal/RedisCommandType.java
##
@@ -265,6 +267,9 @@
   FLUSHDB(new FlushAllExecutor(), UNSUPPORTED, new 
MaximumParameterRequirements(2, ERROR_SYNTAX)),
   INFO(new InfoExecutor(), UNSUPPORTED, new MaximumParameterRequirements(2, 
ERROR_SYNTAX)),
   SHUTDOWN(new ShutDownExecutor(), UNSUPPORTED, new 
MaximumParameterRequirements(2, ERROR_SYNTAX)),
+  SLOWLOG(new SlowlogExecutor(), UNSUPPORTED, new 
MinimumParameterRequirements(2)
+  .and(new MaximumParameterRequirements(3, ERROR_SYNTAX))
+  .and(new SlowlogParameterRequirements())),

Review comment:
   I don't think you need the `new MaximumParameterRequirements(3, 
ERROR_SYNTAX)` here.  In Redis, when more than 3 parameters are provided, the 
following error is returned: `ERR Unknown subcommand or wrong number of 
arguments for '[WHATEVER OPTION YOU TYPED]'. Try SLOWLOG HELP.` (not syntax 
error).

##
File path: 
geode-redis/src/integrationTest/java/org/apache/geode/redis/internal/executor/server/AbstractSlowlogIntegrationTest.java
##
@@ -0,0 +1,117 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more 
contributor license
+ * agreements. See the NOTICE file distributed with this work for additional 
information regarding
+ * copyright ownership. The ASF licenses this file to You under the Apache 
License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance with the 
License. You may obtain a
+ * copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software 
distributed under the License
+ * is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY 
KIND, either express
+ * or implied. See the License for the specific language governing permissions 
and limitations under
+ * the License.
+ */
+
+package org.apache.geode.redis.internal.executor.server;
+
+import static org.assertj.core.api.Assertions.assertThat;
+import static org.assertj.core.api.Assertions.assertThatThrownBy;
+
+import java.util.List;
+
+import org.junit.After;
+import org.junit.Before;
+import org.junit.Test;
+import redis.clients.jedis.Jedis;
+import redis.clients.jedis.Protocol;
+import redis.clients.jedis.util.Slowlog;
+
+import org.apache.geode.test.awaitility.GeodeAwaitility;
+import org.apache.geode.test.dunit.rules.RedisPortSupplier;
+
+public abstract class AbstractSlowlogIntegrationTest implements 
RedisPortSupplier {
+
+  private Jedis jedis;
+  private static final int REDIS_CLIENT_TIMEOUT =
+  Math.toIntExact(GeodeAwaitility.getTimeout().toMillis());
+
+  @Before
+  public void setUp() {
+jedis = new Jedis("localhost", getPort(), REDIS_CLIENT_TIMEOUT);
+  }
+
+  @After
+  public void tearDown() {
+jedis.close();
+  }
+
+  abstract int getExposedPort();
+
+  @Test
+  public void shouldReturnEmptyArray_whenGetSubcommandSpecified() {
+List actualResult = jedis.slowlogGet();
+
+assertThat(actualResult.size()).isEqualTo(0);
+  }
+
+  @Test
+  public void 
shouldReturnEmptyArray_whenGetSubcommandSpecified_withLengthParameter() {
+List actualResult = jedis.slowlogGet(200);
+
+assertThat(actualResult.size()).isEqualTo(0);
+  }
+
+  @Test
+  public void 
shouldReturnEmptyArray_whenGetSubcommandSpecified_withNegativeLengthParameter() 
{
+List actualResult = jedis.slowlogGet(-200);
+
+assertThat(actualResult.size()).isEqualTo(0);
+  }
+
+  @Test
+  public void shouldReturnZero_whenLenSubcommandSpecified() {
+Long length = jedis.slowlogLen();
+
+assertThat(length).isEqualTo(0L);
+  }
+
+  @Test
+  public void shouldReturnOK_whenResetSubcommandSpecified() {
+String response = jedis.slowlogReset();
+
+assertThat(response).isEqualTo("OK");
+  }
+
+  @Test
+  public void shouldThrowException_IfResetGivenAParameter() {
+assertThatThrownBy(
+() -> jedis.sendCommand(
+Protocol.Command.SLOWLOG, "RESET", "Superfluous"))
+.hasMessageContaining("ERR Unknown subcommand or wrong number 
of arguments for");

Review comment:
   It would be nice to put the full error message here and verify that it 
matches exactly (so `hasMessage()` vs. `hasMessageContaining()` (if we end up 
returning the same messages as Redis).

##
File path: 
geode-redis/src/integrationTest/java/org/apache/geode/redis/internal/executor/server/AbstractSlowlogIntegrationTest.java
##
@@ -0,0 +1,117 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more 
contributo

[jira] [Commented] (GEODE-8706) testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure in CI

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232953#comment-17232953
 ] 

ASF GitHub Bot commented on GEODE-8706:
---

boglesby commented on pull request #5750:
URL: https://github.com/apache/geode/pull/5750#issuecomment-728218413


   The lock is controlling access to the connection field. The 
destroyConnection and initializeConnection methods set it so they use the 
writeLock, and the readAcknowledgement and _dispatchBatch methods access it so 
they use the readLock. The getConnection method should also use the readLock, 
but it hasn't up until now with this change.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure 
> in CI
> ---
>
> Key: GEODE-8706
> URL: https://issues.apache.org/jira/browse/GEODE-8706
> Project: Geode
>  Issue Type: Bug
>Reporter: John Hutchison
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
>
> __org.apache.geode.internal.cache.wan.offheap.SerialWANPropagationOffHeapDUnitTest
>  > testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode 
> FAILED
>  org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.wan.serial.SerialWANPropagationDUnitTest$$Lambda$181/1328086690.run
>  in VM 4 running on Host 3e4042fb1cd7 with 8 VMs
>  Caused by:
>  org.awaitility.core.ConditionTimeoutException: Assertion condition defined 
> as a lambda expression in org.apache.geode.internal.cache.wan.WANTestBase 
> that uses java.util.Set, java.util.Setint Expected queue entries: 0 but 
> actual entries: 8000 expected:<0> but was:<8000> within 5 minutes.
>  Caused by:
>  java.lang.AssertionError: Expected queue entries: 0 but actual entries: 8000 
> expected:<0> but was:<8000>
>  java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
>  Fix the strings or use IgnoredException.addIgnoredException to ignore.
>  --
>  Found suspect string in 'dunit_suspect-vm4.log' at line 68
>  [fatal 2020/11/13 18:09:30.989 GMT  for GatewaySender_ln> tid=619] Stopping the processor because the following 
> exception occurred while processing a batch: 
>  java.lang.NullPointerException
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.getConnection(GatewaySenderEventRemoteDispatcher.java:299)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.readAcknowledgement(GatewaySenderEventRemoteDispatcher.java:107)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher$AckReaderThread.run(GatewaySenderEventRemoteDispatcher.java:605)
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8706) testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure in CI

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232960#comment-17232960
 ] 

ASF GitHub Bot commented on GEODE-8706:
---

boglesby commented on a change in pull request #5750:
URL: https://github.com/apache/geode/pull/5750#discussion_r524456655



##
File path: 
geode-wan/src/main/java/org/apache/geode/internal/cache/wan/GatewaySenderEventRemoteDispatcher.java
##
@@ -295,9 +301,16 @@ public Connection getConnection(boolean 
startAckReaderThread) throws GatewaySend
 // OR the connection's ServerLocation doesn't match with the one stored in 
sender
 // THEN initialize the connection
 if (!this.sender.isParallel()) {
-  if (this.connection == null || this.connection.isDestroyed()
-  || this.connection.getServer() == null
-  || 
!this.connection.getServer().equals(this.sender.getServerLocation())) {
+  boolean needToReconnect = false;
+  getConnectionLifeCycleLock().readLock().lock();
+  try {
+needToReconnect = this.connection == null || 
this.connection.isDestroyed()
+|| this.connection.getServer() == null
+|| 
!this.connection.getServer().equals(this.sender.getServerLocation());
+  } finally {
+getConnectionLifeCycleLock().readLock().unlock();
+  }
+  if (needToReconnect) {
 if (logger.isDebugEnabled()) {

Review comment:
   Does this debug statement need to be moved inside the readLock too since 
it calls `this.connection.getServer()`?





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure 
> in CI
> ---
>
> Key: GEODE-8706
> URL: https://issues.apache.org/jira/browse/GEODE-8706
> Project: Geode
>  Issue Type: Bug
>Reporter: John Hutchison
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
>
> __org.apache.geode.internal.cache.wan.offheap.SerialWANPropagationOffHeapDUnitTest
>  > testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode 
> FAILED
>  org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.wan.serial.SerialWANPropagationDUnitTest$$Lambda$181/1328086690.run
>  in VM 4 running on Host 3e4042fb1cd7 with 8 VMs
>  Caused by:
>  org.awaitility.core.ConditionTimeoutException: Assertion condition defined 
> as a lambda expression in org.apache.geode.internal.cache.wan.WANTestBase 
> that uses java.util.Set, java.util.Setint Expected queue entries: 0 but 
> actual entries: 8000 expected:<0> but was:<8000> within 5 minutes.
>  Caused by:
>  java.lang.AssertionError: Expected queue entries: 0 but actual entries: 8000 
> expected:<0> but was:<8000>
>  java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
>  Fix the strings or use IgnoredException.addIgnoredException to ignore.
>  --
>  Found suspect string in 'dunit_suspect-vm4.log' at line 68
>  [fatal 2020/11/13 18:09:30.989 GMT  for GatewaySender_ln> tid=619] Stopping the processor because the following 
> exception occurred while processing a batch: 
>  java.lang.NullPointerException
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.getConnection(GatewaySenderEventRemoteDispatcher.java:299)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.readAcknowledgement(GatewaySenderEventRemoteDispatcher.java:107)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher$AckReaderThread.run(GatewaySenderEventRemoteDispatcher.java:605)
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8704) many CI failures in Jetty9CachingClientServerTest

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232975#comment-17232975
 ] 

ASF GitHub Bot commented on GEODE-8704:
---

sabbey37 opened a new pull request #5754:
URL: https://github.com/apache/geode/pull/5754


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> many CI failures in Jetty9CachingClientServerTest
> -
>
> Key: GEODE-8704
> URL: https://issues.apache.org/jira/browse/GEODE-8704
> Project: Geode
>  Issue Type: Bug
>  Components: http session
>Affects Versions: 1.14.0
>Reporter: Kamilla Aslami
>Assignee: Benjamin P Ross
>Priority: Major
>  Labels: pull-request-available
>
> A bunch of failures:
> {code:java}
> org.apache.geode.session.tests.Jetty9CachingClientServerTest > 
> shouldCacheSessionOnClient FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.internal.IdentifiableRunnable.run in VM 1 running 
> on Host e4bd15534025 with 4 VMs
> at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:623)
> at org.apache.geode.test.dunit.VM.invoke(VM.java:434)
> at 
> org.apache.geode.test.junit.rules.VMProvider.invoke(VMProvider.java:102)
> at 
> org.apache.geode.session.tests.Jetty9CachingClientServerTest.shouldCacheSessionOnClient(Jetty9CachingClientServerTest.java:57)
> Caused by:
> java.lang.IllegalStateException: Session is invalid
> at 
> org.apache.geode.modules.session.internal.filter.GemfireHttpSession.checkValid(GemfireHttpSession.java:317)
> at 
> org.apache.geode.modules.session.internal.filter.GemfireHttpSession.setAttribute(GemfireHttpSession.java:301)
> at 
> org.apache.geode.session.tests.Jetty9CachingClientServerTest.lambda$null$0(Jetty9CachingClientServerTest.java:60)
> at java.lang.Iterable.forEach(Iterable.java:75)
> at 
> java.util.Collections$UnmodifiableCollection.forEach(Collections.java:1085)
> at 
> org.apache.geode.session.tests.Jetty9CachingClientServerTest.lambda$shouldCacheSessionOnClient$6b5e8371$1(Jetty9CachingClientServerTest.java:60)org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > shouldNotLeaveNativeSessionInContainer FAILED
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldReplicateCookies FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> 
> java.util.concurrent.TimeoutExceptionorg.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldShareSessionExpirationReset FAILED
> org.junit.ComparisonFailure: 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldExpireInSetTimeframe FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldHavePersistentSessionData FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > failureShouldStillAllowOtherContainersDataAccess FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > invalidationShouldRemoveValueAccessForAllContainers

[jira] [Commented] (GEODE-8706) testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure in CI

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17232977#comment-17232977
 ] 

ASF GitHub Bot commented on GEODE-8706:
---

gesterzhou commented on a change in pull request #5750:
URL: https://github.com/apache/geode/pull/5750#discussion_r524477248



##
File path: 
geode-wan/src/main/java/org/apache/geode/internal/cache/wan/GatewaySenderEventRemoteDispatcher.java
##
@@ -295,9 +301,16 @@ public Connection getConnection(boolean 
startAckReaderThread) throws GatewaySend
 // OR the connection's ServerLocation doesn't match with the one stored in 
sender
 // THEN initialize the connection
 if (!this.sender.isParallel()) {
-  if (this.connection == null || this.connection.isDestroyed()
-  || this.connection.getServer() == null
-  || 
!this.connection.getServer().equals(this.sender.getServerLocation())) {
+  boolean needToReconnect = false;
+  getConnectionLifeCycleLock().readLock().lock();
+  try {
+needToReconnect = this.connection == null || 
this.connection.isDestroyed()
+|| this.connection.getServer() == null
+|| 
!this.connection.getServer().equals(this.sender.getServerLocation());
+  } finally {
+getConnectionLifeCycleLock().readLock().unlock();
+  }
+  if (needToReconnect) {
 if (logger.isDebugEnabled()) {

Review comment:
   No need, you see the original author used:
   (this.connection == null) ? "null" : ...
   
   That means he realized this connection could be null here





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure 
> in CI
> ---
>
> Key: GEODE-8706
> URL: https://issues.apache.org/jira/browse/GEODE-8706
> Project: Geode
>  Issue Type: Bug
>Reporter: John Hutchison
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
>
> __org.apache.geode.internal.cache.wan.offheap.SerialWANPropagationOffHeapDUnitTest
>  > testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode 
> FAILED
>  org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.wan.serial.SerialWANPropagationDUnitTest$$Lambda$181/1328086690.run
>  in VM 4 running on Host 3e4042fb1cd7 with 8 VMs
>  Caused by:
>  org.awaitility.core.ConditionTimeoutException: Assertion condition defined 
> as a lambda expression in org.apache.geode.internal.cache.wan.WANTestBase 
> that uses java.util.Set, java.util.Setint Expected queue entries: 0 but 
> actual entries: 8000 expected:<0> but was:<8000> within 5 minutes.
>  Caused by:
>  java.lang.AssertionError: Expected queue entries: 0 but actual entries: 8000 
> expected:<0> but was:<8000>
>  java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
>  Fix the strings or use IgnoredException.addIgnoredException to ignore.
>  --
>  Found suspect string in 'dunit_suspect-vm4.log' at line 68
>  [fatal 2020/11/13 18:09:30.989 GMT  for GatewaySender_ln> tid=619] Stopping the processor because the following 
> exception occurred while processing a batch: 
>  java.lang.NullPointerException
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.getConnection(GatewaySenderEventRemoteDispatcher.java:299)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.readAcknowledgement(GatewaySenderEventRemoteDispatcher.java:107)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher$AckReaderThread.run(GatewaySenderEventRemoteDispatcher.java:605)
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (GEODE-8718) upgrade Shiro to 1.7.0

2020-11-16 Thread Owen Nichols (Jira)
Owen Nichols created GEODE-8718:
---

 Summary: upgrade Shiro to 1.7.0
 Key: GEODE-8718
 URL: https://issues.apache.org/jira/browse/GEODE-8718
 Project: Geode
  Issue Type: Bug
  Components: management
Affects Versions: 1.13.0, 1.12.0, 1.14.0
Reporter: Owen Nichols


Our current Shiro version (1.6.0) is below the recommended version.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8718) upgrade Shiro to 1.7.0

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233022#comment-17233022
 ] 

ASF GitHub Bot commented on GEODE-8718:
---

onichols-pivotal opened a new pull request #5755:
URL: https://github.com/apache/geode/pull/5755


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> upgrade Shiro to 1.7.0
> --
>
> Key: GEODE-8718
> URL: https://issues.apache.org/jira/browse/GEODE-8718
> Project: Geode
>  Issue Type: Bug
>  Components: management
>Affects Versions: 1.12.0, 1.13.0, 1.14.0
>Reporter: Owen Nichols
>Priority: Major
>
> Our current Shiro version (1.6.0) is below the recommended version.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (GEODE-8718) upgrade Shiro to 1.7.0

2020-11-16 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-8718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated GEODE-8718:
--
Labels: pull-request-available  (was: )

> upgrade Shiro to 1.7.0
> --
>
> Key: GEODE-8718
> URL: https://issues.apache.org/jira/browse/GEODE-8718
> Project: Geode
>  Issue Type: Bug
>  Components: management
>Affects Versions: 1.12.0, 1.13.0, 1.14.0
>Reporter: Owen Nichols
>Priority: Major
>  Labels: pull-request-available
>
> Our current Shiro version (1.6.0) is below the recommended version.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8711) Enable SLOWLOG command to support Redis monitoring tools

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233030#comment-17233030
 ] 

ASF GitHub Bot commented on GEODE-8711:
---

sabbey37 commented on a change in pull request #5749:
URL: https://github.com/apache/geode/pull/5749#discussion_r524442295



##
File path: 
geode-redis/src/main/java/org/apache/geode/redis/internal/executor/server/SlowlogExecutor.java
##
@@ -30,7 +27,21 @@
   @Override
   public RedisResponse executeCommand(Command command,
   ExecutionHandlerContext context) {
-List values = new ArrayList();
-return RedisResponse.array(values);
+String subCommand = command.getStringKey();
+if (subCommand.toLowerCase().equals("get")) {
+  return RedisResponse.emptyArray();
+} else if (subCommand.toLowerCase().equals("len")) {
+  return RedisResponse.integer(0);
+} else if (subCommand.toLowerCase().equals("reset")) {
+  return RedisResponse.ok();
+}
+throw new 
RedisParametersMismatchException(wrongNumberOfArgumentsErrorMessage());
+  }
+
+  public String wrongNumberOfArgumentsErrorMessage() {
+String result;
+result = String
+.format("Unknown subcommand or wrong number of arguments for 'len'. 
Try SLOWLOG HELP.");
+return result;

Review comment:
   It seems kind of weird that we have this hard coded to `len`... right 
now if I type in `slowlog sjls`, I get `ERR Unknown subcommand or wrong number 
of arguments for 'len'. Try SLOWLOG HELP.` instead of `ERR Unknown subcommand 
or wrong number of arguments for 'sjls'. Try SLOWLOG HELP.` It also might be 
good to add this Error to the `RedisConstants` class.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Enable SLOWLOG command to support Redis monitoring tools
> 
>
> Key: GEODE-8711
> URL: https://issues.apache.org/jira/browse/GEODE-8711
> Project: Geode
>  Issue Type: Improvement
>  Components: redis
>Affects Versions: 1.14.0
>Reporter: Raymond Ingles
>Priority: Major
>  Labels: pull-request-available
>
> The Redis SLOWLOG command tracks slow-executing commands. This will implement 
> a placeholder to prevent errors in tools like Datadog or Redis Insights.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8553) Reduce ThinClientLocatorHelper lock time

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233031#comment-17233031
 ] 

ASF GitHub Bot commented on GEODE-8553:
---

codecov-io edited a comment on pull request #660:
URL: https://github.com/apache/geode-native/pull/660#issuecomment-706753854


   # [Codecov](https://codecov.io/gh/apache/geode-native/pull/660?src=pr&el=h1) 
Report
   > Merging 
[#660](https://codecov.io/gh/apache/geode-native/pull/660?src=pr&el=desc) 
(ebb7cb8) into 
[develop](https://codecov.io/gh/apache/geode-native/commit/0d9a99d5e0632de62df17921950cf3f6640efb33?el=desc)
 (0d9a99d) will **increase** coverage by `0.00%`.
   > The diff coverage is `94.65%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/geode-native/pull/660/graphs/tree.svg?width=650&height=150&src=pr&token=plpAqoqGag)](https://codecov.io/gh/apache/geode-native/pull/660?src=pr&el=tree)
   
   ```diff
   @@Coverage Diff@@
   ##   develop #660+/-   ##
   =
 Coverage74.04%   74.04%
   =
 Files  644  644
 Lines5118951086   -103 
   =
   - Hits 3790337828-75 
   + Misses   1328613258-28 
   ```
   
   
   | [Impacted 
Files](https://codecov.io/gh/apache/geode-native/pull/660?src=pr&el=tree) | 
Coverage Δ | |
   |---|---|---|
   | 
[cppcache/src/ThinClientLocatorHelper.cpp](https://codecov.io/gh/apache/geode-native/pull/660/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL1RoaW5DbGllbnRMb2NhdG9ySGVscGVyLmNwcA==)
 | `93.87% <94.44%> (+7.76%)` | :arrow_up: |
   | 
[cppcache/src/ThinClientLocatorHelper.hpp](https://codecov.io/gh/apache/geode-native/pull/660/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL1RoaW5DbGllbnRMb2NhdG9ySGVscGVyLmhwcA==)
 | `100.00% <100.00%> (ø)` | |
   | 
[cppcache/src/ThinClientPoolDM.cpp](https://codecov.io/gh/apache/geode-native/pull/660/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL1RoaW5DbGllbnRQb29sRE0uY3Bw)
 | `76.32% <100.00%> (-0.06%)` | :arrow_down: |
   | 
[cppcache/src/ReadWriteLock.cpp](https://codecov.io/gh/apache/geode-native/pull/660/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL1JlYWRXcml0ZUxvY2suY3Bw)
 | `62.50% <0.00%> (-18.75%)` | :arrow_down: |
   | 
[cppcache/src/PdxTypeRegistry.cpp](https://codecov.io/gh/apache/geode-native/pull/660/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL1BkeFR5cGVSZWdpc3RyeS5jcHA=)
 | `77.94% <0.00%> (-2.21%)` | :arrow_down: |
   | 
[cppcache/src/ClientMetadataService.cpp](https://codecov.io/gh/apache/geode-native/pull/660/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL0NsaWVudE1ldGFkYXRhU2VydmljZS5jcHA=)
 | `62.24% <0.00%> (-0.46%)` | :arrow_down: |
   | 
[cppcache/src/ThinClientRedundancyManager.cpp](https://codecov.io/gh/apache/geode-native/pull/660/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL1RoaW5DbGllbnRSZWR1bmRhbmN5TWFuYWdlci5jcHA=)
 | `76.09% <0.00%> (-0.32%)` | :arrow_down: |
   | 
[cppcache/src/TcrConnection.cpp](https://codecov.io/gh/apache/geode-native/pull/660/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL1RjckNvbm5lY3Rpb24uY3Bw)
 | `72.95% <0.00%> (+0.47%)` | :arrow_up: |
   | 
[cppcache/src/TcrEndpoint.cpp](https://codecov.io/gh/apache/geode-native/pull/660/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL1RjckVuZHBvaW50LmNwcA==)
 | `55.11% <0.00%> (+0.56%)` | :arrow_up: |
   | ... and [2 
more](https://codecov.io/gh/apache/geode-native/pull/660/diff?src=pr&el=tree-more)
 | |
   
   --
   
   [Continue to review full report at 
Codecov](https://codecov.io/gh/apache/geode-native/pull/660?src=pr&el=continue).
   > **Legend** - [Click here to learn 
more](https://docs.codecov.io/docs/codecov-delta)
   > `Δ = absolute  (impact)`, `ø = not affected`, `? = missing data`
   > Powered by 
[Codecov](https://codecov.io/gh/apache/geode-native/pull/660?src=pr&el=footer). 
Last update 
[0d9a99d...ebb7cb8](https://codecov.io/gh/apache/geode-native/pull/660?src=pr&el=lastupdated).
 Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Reduce ThinClientLocatorHelper lock time
> 
>
> Key: GEODE-8553
> URL: https://issues.apache.org/jira/browse/GEODE-8553
> Project: Geode
>  Issue Type: Improvement
>  Components: native client
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Mario Salazar de Torres
>Assignee: Mario Salazar de Torres
>Priority: Major
>   

[jira] [Created] (GEODE-8719) CI Failure: org.apache.geode.redis.internal.executor.CrashAndNoRepeatDUnitTest > givenServerCrashesDuringAPPEND_thenDataIsNotLost

2020-11-16 Thread Sarah Abbey (Jira)
Sarah Abbey created GEODE-8719:
--

 Summary: CI Failure: 
org.apache.geode.redis.internal.executor.CrashAndNoRepeatDUnitTest > 
givenServerCrashesDuringAPPEND_thenDataIsNotLost
 Key: GEODE-8719
 URL: https://issues.apache.org/jira/browse/GEODE-8719
 Project: Geode
  Issue Type: Bug
  Components: redis
Reporter: Sarah Abbey


CI failure: https://concourse.apachegeode-ci.info/builds/207449
{code:java}
org.apache.geode.redis.internal.executor.CrashAndNoRepeatDUnitTest > 
givenServerCrashesDuringAPPEND_thenDataIsNotLost FAILED
org.apache.geode.test.dunit.RMIException: While invoking 
org.apache.geode.test.dunit.internal.IdentifiableCallable.call in VM 2 running 
on Host e0e2f6af9445 with 4 VMs
at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:623)
at org.apache.geode.test.dunit.VM.invoke(VM.java:460)
at 
org.apache.geode.test.dunit.rules.ClusterStartupRule.startServerVM(ClusterStartupRule.java:268)
at 
org.apache.geode.test.dunit.rules.ClusterStartupRule.startServerVM(ClusterStartupRule.java:261)
at 
org.apache.geode.redis.internal.executor.CrashAndNoRepeatDUnitTest.startRedisVM(CrashAndNoRepeatDUnitTest.java:131)
at 
org.apache.geode.redis.internal.executor.CrashAndNoRepeatDUnitTest.givenServerCrashesDuringAPPEND_thenDataIsNotLost(CrashAndNoRepeatDUnitTest.java:164)
Caused by:
org.apache.geode.management.ManagementException: Could not start Redis 
Server using bind address: localhost/127.0.0.1 and port: 44579. Please make 
sure nothing else is running on this address/port combination.
Caused by:
java.net.BindException: Address already in use
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (GEODE-8719) CI Failure: org.apache.geode.redis.internal.executor.CrashAndNoRepeatDUnitTest > givenServerCrashesDuringAPPEND_thenDataIsNotLost

2020-11-16 Thread Sarah Abbey (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-8719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sarah Abbey updated GEODE-8719:
---
Priority: Minor  (was: Major)

> CI Failure: 
> org.apache.geode.redis.internal.executor.CrashAndNoRepeatDUnitTest > 
> givenServerCrashesDuringAPPEND_thenDataIsNotLost
> -
>
> Key: GEODE-8719
> URL: https://issues.apache.org/jira/browse/GEODE-8719
> Project: Geode
>  Issue Type: Bug
>  Components: redis
>Reporter: Sarah Abbey
>Priority: Minor
>
> CI failure: https://concourse.apachegeode-ci.info/builds/207449
> {code:java}
> org.apache.geode.redis.internal.executor.CrashAndNoRepeatDUnitTest > 
> givenServerCrashesDuringAPPEND_thenDataIsNotLost FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.internal.IdentifiableCallable.call in VM 2 
> running on Host e0e2f6af9445 with 4 VMs
> at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:623)
> at org.apache.geode.test.dunit.VM.invoke(VM.java:460)
> at 
> org.apache.geode.test.dunit.rules.ClusterStartupRule.startServerVM(ClusterStartupRule.java:268)
> at 
> org.apache.geode.test.dunit.rules.ClusterStartupRule.startServerVM(ClusterStartupRule.java:261)
> at 
> org.apache.geode.redis.internal.executor.CrashAndNoRepeatDUnitTest.startRedisVM(CrashAndNoRepeatDUnitTest.java:131)
> at 
> org.apache.geode.redis.internal.executor.CrashAndNoRepeatDUnitTest.givenServerCrashesDuringAPPEND_thenDataIsNotLost(CrashAndNoRepeatDUnitTest.java:164)
> Caused by:
> org.apache.geode.management.ManagementException: Could not start 
> Redis Server using bind address: localhost/127.0.0.1 and port: 44579. Please 
> make sure nothing else is running on this address/port combination.   
>  Caused by:
> java.net.BindException: Address already in use
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8537) Memory increases whenever LRU eviction is enabled

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233053#comment-17233053
 ] 

ASF GitHub Bot commented on GEODE-8537:
---

pdxcodemonkey commented on a change in pull request #687:
URL: https://github.com/apache/geode-native/pull/687#discussion_r524527337



##
File path: cppcache/src/LRUQueue.hpp
##
@@ -0,0 +1,100 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *  http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+#pragma once
+
+#ifndef GEODE_LRUQUEUE_H_
+#define GEODE_LRUQUEUE_H_
+
+#include 
+#include 
+
+#include 
+
+#include "util/concurrent/spinlock_mutex.hpp"
+
+namespace apache {
+namespace geode {
+namespace client {
+
+class MapEntryImpl;
+
+/**
+ * This class holds a queue of entries sorted by its use order
+ * @note All accesses to the queue are mutually exclusive
+ */
+class LRUQueue {

Review comment:
   boost::lockfree::queue, maybe?  If this is just a generic thread-safe 
queue implementation, we really shouldn't be writing our own.  This one uses 
our own hand-rolled mutex for locking, as well, which is _also_ a thing that we 
should never have written, that should be removed from the codebase altogether. 
 

##
File path: cppcache/src/EvictionController.hpp
##
@@ -75,29 +73,27 @@ class EvictionController {
 
   void svc(void);
 
-  void updateRegionHeapInfo(int64_t info);
-  void registerRegion(const std::string& name);
-  void deregisterRegion(const std::string& name);
-  void evict(int32_t percentage);
+  void evict(float percentage);
+  void inc_heap_size(int64_t delta);
+  void register_region(const std::string& name);
+  void unregister_region(const std::string& name);

Review comment:
   The style guide (I re-learned after reviewing it) dictates snake case 
for variables and setter/getter functions, and mixed case otherwise.  I prefer 
the mixed case for method names, for consistency if nothing else - can we 
change the instances of this in the PR?

##
File path: cppcache/src/LRUQueue.hpp
##
@@ -0,0 +1,100 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *  http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+#pragma once
+
+#ifndef GEODE_LRUQUEUE_H_
+#define GEODE_LRUQUEUE_H_
+
+#include 
+#include 
+
+#include 
+
+#include "util/concurrent/spinlock_mutex.hpp"
+
+namespace apache {
+namespace geode {
+namespace client {
+
+class MapEntryImpl;
+
+/**
+ * This class holds a queue of entries sorted by its use order
+ * @note All accesses to the queue are mutually exclusive
+ */
+class LRUQueue {

Review comment:
   Let us know if there's some reason here we can't use an off-the-shelf 
queue solution.  Otherwise, let's use one of those.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Memory increases whenever LRU eviction is enabled
> -
>
> Key: GEODE-8537
> URL: https://issues.apache.org/jira/browse/GEODE-8537
> Project: Geode
>  Issue Type: Bug
>  Components: native client
>Affects Versions: 1.13.0
>Reporter: Mario Salazar de Torres
>Assignee: Mario Salazar de Torres
>

[jira] [Commented] (GEODE-8693) C++ native client Function.execute() with onServers does not throw exception if one of the servers goes down while executing the function.

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233057#comment-17233057
 ] 

ASF GitHub Bot commented on GEODE-8693:
---

pdxcodemonkey commented on a change in pull request #690:
URL: https://github.com/apache/geode-native/pull/690#discussion_r524546236



##
File path: cppcache/integration/test/FunctionExecutionTest.cpp
##
@@ -251,3 +253,79 @@ TEST(FunctionExecutionTest, 
OnServersWithReplicatedRegionsInPool) {
   resultList[i + ON_SERVERS_TEST_REGION_ENTRIES_SIZE / 2]);
   }
 }
+
+void executeTestFunctionOnLoopAndExpectNotConnectedException(
+std::shared_ptr pool) {
+  // Filter on odd keys
+  auto routingObj = CacheableVector::create();
+  for (int i = 0; i < ON_SERVERS_TEST_REGION_ENTRIES_SIZE; i++) {
+if (i % 2 == 0) {
+  continue;
+}
+routingObj->push_back(CacheableString::create("KEY--" + 
std::to_string(i)));
+  }
+
+  auto execution = FunctionService::onServers(pool);
+
+  ASSERT_THROW(
+  {
+while (true) {
+  // This call must eventually throw the NotConnectedException
+  auto rc =
+  execution.withArgs(routingObj).execute("MultiGetFunctionISlow");
+
+  auto executeFunctionResult = rc->getResult();
+
+  auto resultList = serverResultsToStrings(executeFunctionResult);
+
+  // Executed on 2 servers, we should have two sets of results
+  ASSERT_EQ(resultList.size(), ON_SERVERS_TEST_REGION_ENTRIES_SIZE);
+}
+  },
+  apache::geode::client::NotConnectedException);
+}
+
+TEST(FunctionExecutionTest, OnServersOneServerGoesDown) {
+  Cluster cluster{
+  LocatorCount{1}, ServerCount{2},
+  CacheXMLFiles(
+  {std::string(getFrameworkString(FrameworkVariable::TestCacheXmlDir)) 
+
+   "/func_cacheserver1_pool.xml",
+   std::string(getFrameworkString(FrameworkVariable::TestCacheXmlDir)) 
+
+   "/func_cacheserver2_pool.xml"})};
+
+  cluster.start([&]() {
+cluster.getGfsh()
+.deploy()
+.jar(getFrameworkString(FrameworkVariable::JavaObjectJarPath))
+.execute();
+  });
+
+  auto cache = CacheFactory().set("log-level", "debug").create();
+  auto poolFactory = cache.getPoolManager().createFactory();
+
+  cluster.applyLocators(poolFactory);
+
+  auto pool =
+  
poolFactory.setLoadConditioningInterval(std::chrono::milliseconds::zero())
+  .setIdleTimeout(std::chrono::milliseconds::zero())
+  .create("pool");
+
+  auto region = cache.createRegionFactory(RegionShortcut::PROXY)
+.setPoolName("pool")
+.create("partition_region");
+
+  for (int i = 0; i < ON_SERVERS_TEST_REGION_ENTRIES_SIZE; i++) {
+region->put("KEY--" + std::to_string(i), "VALUE--" + std::to_string(i));
+  }
+
+  auto threadAux = std::make_shared(
+  executeTestFunctionOnLoopAndExpectNotConnectedException, pool);
+
+  // Sleep a bit to allow for some successful responses before the exception
+  sleep(5);

Review comment:
   This doesn't build on Windows:
   ```
   C:\nativeclient\cppcache\integration\test\FunctionExecutionTest.cpp(326): 
error C3861: 'sleep': identifier not found 
   ```
   Looks like we use `std::this_thread::sleep_for` in the rest of the code





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> C++ native client Function.execute() with onServers does not throw exception 
> if one of the servers goes down while executing the function.
> --
>
> Key: GEODE-8693
> URL: https://issues.apache.org/jira/browse/GEODE-8693
> Project: Geode
>  Issue Type: Bug
>  Components: native client
>Reporter: Alberto Gomez
>Assignee: Alberto Gomez
>Priority: Major
>  Labels: pull-request-available
>
> According to the Apache Geode Native C++ API documentation, the 
> FunctionService.onServers() function will throw an Exception if one of the 
> servers goes down while dispatching or executing the function on the server.
> Nevertheless, currently no exception is thrown in that case.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8693) C++ native client Function.execute() with onServers does not throw exception if one of the servers goes down while executing the function.

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233066#comment-17233066
 ] 

ASF GitHub Bot commented on GEODE-8693:
---

pdxcodemonkey commented on a change in pull request #690:
URL: https://github.com/apache/geode-native/pull/690#discussion_r524547942



##
File path: cppcache/src/ExecutionImpl.cpp
##
@@ -429,33 +429,20 @@ void ExecutionImpl::executeOnAllServers(const 
std::string& func,
   std::shared_ptr exceptionPtr = nullptr;
   GfErrType err = tcrdm->sendRequestToAllServers(
   func.c_str(), getResult, timeout, m_args, m_rc, exceptionPtr);
-  if (exceptionPtr != nullptr && err != GF_NOERR) {
-LOGDEBUG("Execute errorred: %d", err);
-// throw FunctionExecutionException( "Execute: failed to execute function
-// with server." );
-if (err == GF_CACHESERVER_EXCEPTION) {
-  throw FunctionExecutionException(
-  "Execute: failed to execute function with server.");
-} else {
-  throwExceptionIfError("Execute", err);
-}
-  }
-
-  if (err == GF_AUTHENTICATION_FAILED_EXCEPTION ||
-  err == GF_NOT_AUTHORIZED_EXCEPTION ||
-  err == GF_AUTHENTICATION_REQUIRED_EXCEPTION) {
-throwExceptionIfError("Execute", err);
-  }
 
   if (err != GF_NOERR) {
+LOGDEBUG("Execute errorred: %d", err);

Review comment:
   Sorry to be "that guy" about grammar, but errorred isn't really a word.  
Should be "erred" or just avoid it altogether and use "failed".  Thx

##
File path: cppcache/src/ExecutionImpl.cpp
##
@@ -429,33 +429,20 @@ void ExecutionImpl::executeOnAllServers(const 
std::string& func,
   std::shared_ptr exceptionPtr = nullptr;
   GfErrType err = tcrdm->sendRequestToAllServers(
   func.c_str(), getResult, timeout, m_args, m_rc, exceptionPtr);
-  if (exceptionPtr != nullptr && err != GF_NOERR) {
-LOGDEBUG("Execute errorred: %d", err);
-// throw FunctionExecutionException( "Execute: failed to execute function
-// with server." );
-if (err == GF_CACHESERVER_EXCEPTION) {
-  throw FunctionExecutionException(
-  "Execute: failed to execute function with server.");
-} else {
-  throwExceptionIfError("Execute", err);
-}
-  }
-
-  if (err == GF_AUTHENTICATION_FAILED_EXCEPTION ||
-  err == GF_NOT_AUTHORIZED_EXCEPTION ||
-  err == GF_AUTHENTICATION_REQUIRED_EXCEPTION) {
-throwExceptionIfError("Execute", err);
-  }
 
   if (err != GF_NOERR) {
+LOGDEBUG("Execute errorred: %d", err);

Review comment:
   Ugh, and it was even like this to begin with - yikes, sorry.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> C++ native client Function.execute() with onServers does not throw exception 
> if one of the servers goes down while executing the function.
> --
>
> Key: GEODE-8693
> URL: https://issues.apache.org/jira/browse/GEODE-8693
> Project: Geode
>  Issue Type: Bug
>  Components: native client
>Reporter: Alberto Gomez
>Assignee: Alberto Gomez
>Priority: Major
>  Labels: pull-request-available
>
> According to the Apache Geode Native C++ API documentation, the 
> FunctionService.onServers() function will throw an Exception if one of the 
> servers goes down while dispatching or executing the function on the server.
> Nevertheless, currently no exception is thrown in that case.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8693) C++ native client Function.execute() with onServers does not throw exception if one of the servers goes down while executing the function.

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233069#comment-17233069
 ] 

ASF GitHub Bot commented on GEODE-8693:
---

codecov-io commented on pull request #690:
URL: https://github.com/apache/geode-native/pull/690#issuecomment-728320149


   # [Codecov](https://codecov.io/gh/apache/geode-native/pull/690?src=pr&el=h1) 
Report
   > Merging 
[#690](https://codecov.io/gh/apache/geode-native/pull/690?src=pr&el=desc) 
(bf89d76) into 
[develop](https://codecov.io/gh/apache/geode-native/commit/5c7d47d34c2b8a53874ec6f53e66c2290fd0427c?el=desc)
 (5c7d47d) will **decrease** coverage by `0.07%`.
   > The diff coverage is `54.54%`.
   
   [![Impacted file tree 
graph](https://codecov.io/gh/apache/geode-native/pull/690/graphs/tree.svg?width=650&height=150&src=pr&token=plpAqoqGag)](https://codecov.io/gh/apache/geode-native/pull/690?src=pr&el=tree)
   
   ```diff
   @@ Coverage Diff @@
   ##   develop #690  +/-   ##
   ===
   - Coverage74.12%   74.04%   -0.08% 
   ===
 Files  644  644  
 Lines5118751184   -3 
   ===
   - Hits 3794237899  -43 
   - Misses   1324513285  +40 
   ```
   
   
   | [Impacted 
Files](https://codecov.io/gh/apache/geode-native/pull/690?src=pr&el=tree) | 
Coverage Δ | |
   |---|---|---|
   | 
[cppcache/src/ExecutionImpl.cpp](https://codecov.io/gh/apache/geode-native/pull/690/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL0V4ZWN1dGlvbkltcGwuY3Bw)
 | `69.92% <28.57%> (+1.84%)` | :arrow_up: |
   | 
[cppcache/src/ThinClientPoolDM.cpp](https://codecov.io/gh/apache/geode-native/pull/690/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL1RoaW5DbGllbnRQb29sRE0uY3Bw)
 | `76.71% <100.00%> (+0.23%)` | :arrow_up: |
   | 
[cppcache/src/ReadWriteLock.cpp](https://codecov.io/gh/apache/geode-native/pull/690/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL1JlYWRXcml0ZUxvY2suY3Bw)
 | `81.25% <0.00%> (-18.75%)` | :arrow_down: |
   | 
[...tion-test/testThinClientRemoteQueryFailoverPdx.cpp](https://codecov.io/gh/apache/geode-native/pull/690/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvaW50ZWdyYXRpb24tdGVzdC90ZXN0VGhpbkNsaWVudFJlbW90ZVF1ZXJ5RmFpbG92ZXJQZHguY3Bw)
 | `85.48% <0.00%> (-4.04%)` | :arrow_down: |
   | 
[cppcache/src/TcrEndpoint.cpp](https://codecov.io/gh/apache/geode-native/pull/690/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL1RjckVuZHBvaW50LmNwcA==)
 | `54.27% <0.00%> (-2.39%)` | :arrow_down: |
   | 
[cppcache/src/PdxTypeRegistry.cpp](https://codecov.io/gh/apache/geode-native/pull/690/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL1BkeFR5cGVSZWdpc3RyeS5jcHA=)
 | `77.94% <0.00%> (-2.21%)` | :arrow_down: |
   | 
[cppcache/src/TXCommitMessage.cpp](https://codecov.io/gh/apache/geode-native/pull/690/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL1RYQ29tbWl0TWVzc2FnZS5jcHA=)
 | `74.50% <0.00%> (-1.97%)` | :arrow_down: |
   | 
[cppcache/src/TcrConnection.cpp](https://codecov.io/gh/apache/geode-native/pull/690/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL1RjckNvbm5lY3Rpb24uY3Bw)
 | `72.48% <0.00%> (-1.26%)` | :arrow_down: |
   | 
[cppcache/src/ClientMetadataService.cpp](https://codecov.io/gh/apache/geode-native/pull/690/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvc3JjL0NsaWVudE1ldGFkYXRhU2VydmljZS5jcHA=)
 | `61.78% <0.00%> (-1.15%)` | :arrow_down: |
   | 
[...tegration-test/testThinClientRemoteRegionQuery.cpp](https://codecov.io/gh/apache/geode-native/pull/690/diff?src=pr&el=tree#diff-Y3BwY2FjaGUvaW50ZWdyYXRpb24tdGVzdC90ZXN0VGhpbkNsaWVudFJlbW90ZVJlZ2lvblF1ZXJ5LmNwcA==)
 | `82.33% <0.00%> (-1.13%)` | :arrow_down: |
   | ... and [8 
more](https://codecov.io/gh/apache/geode-native/pull/690/diff?src=pr&el=tree-more)
 | |
   
   --
   
   [Continue to review full report at 
Codecov](https://codecov.io/gh/apache/geode-native/pull/690?src=pr&el=continue).
   > **Legend** - [Click here to learn 
more](https://docs.codecov.io/docs/codecov-delta)
   > `Δ = absolute  (impact)`, `ø = not affected`, `? = missing data`
   > Powered by 
[Codecov](https://codecov.io/gh/apache/geode-native/pull/690?src=pr&el=footer). 
Last update 
[5c7d47d...bf89d76](https://codecov.io/gh/apache/geode-native/pull/690?src=pr&el=lastupdated).
 Read the [comment docs](https://docs.codecov.io/docs/pull-request-comments).
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> C++ native client Function.execute() with onServers does not throw exception 
> if one of the servers goes down while executing the function.
> ---

[jira] [Commented] (GEODE-8715) Redis INFO command 'Keyspace' section should not be present if no keys in database

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233072#comment-17233072
 ] 

ASF GitHub Bot commented on GEODE-8715:
---

sabbey37 merged pull request #5753:
URL: https://github.com/apache/geode/pull/5753


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Redis INFO command 'Keyspace' section should not be present if no keys in 
> database
> --
>
> Key: GEODE-8715
> URL: https://issues.apache.org/jira/browse/GEODE-8715
> Project: Geode
>  Issue Type: Improvement
>  Components: redis
>Reporter: John Hutchison
>Priority: Major
>  Labels: pull-request-available
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8706) testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure in CI

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233074#comment-17233074
 ] 

ASF subversion and git services commented on GEODE-8706:


Commit 1bf18b2e9d3bee8c75ec2acb2331a6c12cc886b6 in geode's branch 
refs/heads/develop from John Hutchison
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=1bf18b2 ]

GEODE-8706: Redis INFO command 'Keyspace' section should not be present if no 
keys in database (#5753)



> testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure 
> in CI
> ---
>
> Key: GEODE-8706
> URL: https://issues.apache.org/jira/browse/GEODE-8706
> Project: Geode
>  Issue Type: Bug
>Reporter: John Hutchison
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
>
> __org.apache.geode.internal.cache.wan.offheap.SerialWANPropagationOffHeapDUnitTest
>  > testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode 
> FAILED
>  org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.wan.serial.SerialWANPropagationDUnitTest$$Lambda$181/1328086690.run
>  in VM 4 running on Host 3e4042fb1cd7 with 8 VMs
>  Caused by:
>  org.awaitility.core.ConditionTimeoutException: Assertion condition defined 
> as a lambda expression in org.apache.geode.internal.cache.wan.WANTestBase 
> that uses java.util.Set, java.util.Setint Expected queue entries: 0 but 
> actual entries: 8000 expected:<0> but was:<8000> within 5 minutes.
>  Caused by:
>  java.lang.AssertionError: Expected queue entries: 0 but actual entries: 8000 
> expected:<0> but was:<8000>
>  java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
>  Fix the strings or use IgnoredException.addIgnoredException to ignore.
>  --
>  Found suspect string in 'dunit_suspect-vm4.log' at line 68
>  [fatal 2020/11/13 18:09:30.989 GMT  for GatewaySender_ln> tid=619] Stopping the processor because the following 
> exception occurred while processing a batch: 
>  java.lang.NullPointerException
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.getConnection(GatewaySenderEventRemoteDispatcher.java:299)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.readAcknowledgement(GatewaySenderEventRemoteDispatcher.java:107)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher$AckReaderThread.run(GatewaySenderEventRemoteDispatcher.java:605)
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8706) testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure in CI

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233076#comment-17233076
 ] 

ASF subversion and git services commented on GEODE-8706:


Commit 1bf18b2e9d3bee8c75ec2acb2331a6c12cc886b6 in geode's branch 
refs/heads/develop from John Hutchison
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=1bf18b2 ]

GEODE-8706: Redis INFO command 'Keyspace' section should not be present if no 
keys in database (#5753)



> testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure 
> in CI
> ---
>
> Key: GEODE-8706
> URL: https://issues.apache.org/jira/browse/GEODE-8706
> Project: Geode
>  Issue Type: Bug
>Reporter: John Hutchison
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
>
> __org.apache.geode.internal.cache.wan.offheap.SerialWANPropagationOffHeapDUnitTest
>  > testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode 
> FAILED
>  org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.wan.serial.SerialWANPropagationDUnitTest$$Lambda$181/1328086690.run
>  in VM 4 running on Host 3e4042fb1cd7 with 8 VMs
>  Caused by:
>  org.awaitility.core.ConditionTimeoutException: Assertion condition defined 
> as a lambda expression in org.apache.geode.internal.cache.wan.WANTestBase 
> that uses java.util.Set, java.util.Setint Expected queue entries: 0 but 
> actual entries: 8000 expected:<0> but was:<8000> within 5 minutes.
>  Caused by:
>  java.lang.AssertionError: Expected queue entries: 0 but actual entries: 8000 
> expected:<0> but was:<8000>
>  java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
>  Fix the strings or use IgnoredException.addIgnoredException to ignore.
>  --
>  Found suspect string in 'dunit_suspect-vm4.log' at line 68
>  [fatal 2020/11/13 18:09:30.989 GMT  for GatewaySender_ln> tid=619] Stopping the processor because the following 
> exception occurred while processing a batch: 
>  java.lang.NullPointerException
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.getConnection(GatewaySenderEventRemoteDispatcher.java:299)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.readAcknowledgement(GatewaySenderEventRemoteDispatcher.java:107)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher$AckReaderThread.run(GatewaySenderEventRemoteDispatcher.java:605)
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8537) Memory increases whenever LRU eviction is enabled

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8537?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233082#comment-17233082
 ] 

ASF GitHub Bot commented on GEODE-8537:
---

gaussianrecurrence commented on a change in pull request #687:
URL: https://github.com/apache/geode-native/pull/687#discussion_r524585761



##
File path: cppcache/src/LRUQueue.hpp
##
@@ -0,0 +1,100 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *  http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+#pragma once
+
+#ifndef GEODE_LRUQUEUE_H_
+#define GEODE_LRUQUEUE_H_
+
+#include 
+#include 
+
+#include 
+
+#include "util/concurrent/spinlock_mutex.hpp"
+
+namespace apache {
+namespace geode {
+namespace client {
+
+class MapEntryImpl;
+
+/**
+ * This class holds a queue of entries sorted by its use order
+ * @note All accesses to the queue are mutually exclusive
+ */
+class LRUQueue {

Review comment:
   Regarding why it is needed a custom written queue. There are several 
reason why:
   
   1. I wanted to encapsulate LRUProperties iterator management inside a class, 
so any code using the LRUQueue does not have to take care of that.
   2. Usually, queues does not allow to remove entries at a random position, 
which would not make possible to clear deleted entries, which is the whole 
point of this PR.
   3. There is no queue implementation that allows to take an entry and move it 
to its end.
   
   What I am not sure about is wether to use a spinlock, an standard mutex or 
put some effort and try to write a completly lock-free thread-safe LRU queue :S





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Memory increases whenever LRU eviction is enabled
> -
>
> Key: GEODE-8537
> URL: https://issues.apache.org/jira/browse/GEODE-8537
> Project: Geode
>  Issue Type: Bug
>  Components: native client
>Affects Versions: 1.13.0
>Reporter: Mario Salazar de Torres
>Assignee: Mario Salazar de Torres
>Priority: Major
>  Labels: pull-request-available
> Attachments: massif-8419.png, massif.out.8419
>
>
> *HAVING* configured concurrency-checks-enabled=false in the client-cache.xml 
> for a region
> *HAVING* configured heap-lru-limit=10 in the client-cache.xml for the region 
> region
> *HAVING* configured heap-lru-delta=10 in the client-cache.xml for the region 
> region
> *HAVING* configured subscription-notification for the pool on which the 
> region is defined
> *HAVING* regsitered interest on all the keys of this region, values included
> *AFTER* receiving lots of LOCA_CREATE and LOCAL_DESTROY notifications
> *THEN* memory increases continously over time, even going over the LRU limit.
> Find massif tool report as massif.out.8419 showing the memory increase.
> Also this is a capture of massif-visualizer for the report:
> !massif-8419.png!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8623) Timing between DNS and Geode startup can result in permanent unknown host exceptions.

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233090#comment-17233090
 ] 

ASF GitHub Bot commented on GEODE-8623:
---

pivotal-jbarrett commented on a change in pull request #5743:
URL: https://github.com/apache/geode/pull/5743#discussion_r524600339



##
File path: geode-common/src/main/java/org/apache/geode/internal/Retry.java
##
@@ -47,45 +47,44 @@ public void sleep(long sleepTime, TimeUnit sleepTimeUnit) 
throws InterruptedExce
 }
   }
 
-  private static final SteadyClock steadyClock = new SteadyClock();
+  private static final SteadyTimer steadyClock = new SteadyTimer();
 
   /**
* Try the supplier function until the predicate is true or timeout occurs.
*
* @param timeout to retry for
+   * @param timeoutUnit the unit for timeout
* @param interval time between each try
-   * @param timeUnit to retry for
+   * @param intervalUnit the unit for interval
* @param supplier to execute until predicate is true or times out
* @param predicate to test for retry
* @param  type of return value
* @return value from supplier after it passes predicate or times out.
*/
-  public static  T tryFor(long timeout,
-  long interval,
-  TimeUnit timeUnit,
+  public static  T tryFor(long timeout, TimeUnit timeoutUnit,
+  long interval, TimeUnit intervalUnit,
   Supplier supplier,
   Predicate predicate) throws TimeoutException, InterruptedException {
-return tryFor(timeout, interval, timeUnit, supplier, predicate, 
steadyClock);
+return tryFor(timeout, timeoutUnit, interval, intervalUnit, supplier, 
predicate, steadyClock);
   }
 
   @VisibleForTesting
-  static  T tryFor(long timeout,
-  long interval,
-  TimeUnit timeUnit,
+  static  T tryFor(long timeout, TimeUnit timeoutUnit,
+  long interval, TimeUnit intervalUnit,
   Supplier supplier,
   Predicate predicate,
-  Clock clock) throws TimeoutException, InterruptedException {
-long until = clock.nanoTime() + NANOSECONDS.convert(timeout, timeUnit);
+  Timer timer) throws TimeoutException, InterruptedException {
+long until = timer.nanoTime() + NANOSECONDS.convert(timeout, timeoutUnit);
 
 T value;
 do {
   value = supplier.get();
   if (predicate.test(value)) {
 return value;
   } else {
-clock.sleep(interval, timeUnit);
+timer.sleep(interval, intervalUnit);

Review comment:
   The last iteration is the problem, why sleep to delay the return. Either 
just return because it would timeout at its regular interval or reduce the 
interval and try again before timing out. Either way there is no point I 
delaying the results in the last iteration as this implementation does.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Timing between DNS and Geode startup can result in permanent unknown host 
> exceptions.
> -
>
> Key: GEODE-8623
> URL: https://issues.apache.org/jira/browse/GEODE-8623
> Project: Geode
>  Issue Type: Bug
>Affects Versions: 1.9.0, 1.9.1, 1.10.0, 1.9.2, 1.11.0, 1.12.0, 1.13.0, 
> 1.14.0, 1.13.1
>Reporter: Jacob Barrett
>Priority: Minor
>  Labels: pull-request-available
>
> In a managed environment were local host name DNS entries and the startup of 
> Geode happen concurrently it is possible for Geode to fail name resolution in 
> the local hostname caching. If it fails to resolve the local hostname when 
> loading the caching utility class then any service dependent on this name 
> will fail without chance for recovery.
> {code}
> [error 2020/09/30 19:50:21.644 UTC  tid=0x1] Jmx manager could not be 
> started because java.net.UnknownHostException
> org.apache.geode.management.ManagementException: java.net.UnknownHostException
>   at 
> org.apache.geode.management.internal.ManagementAgent.startAgent(ManagementAgent.java:133)
>   at 
> org.apache.geode.management.internal.SystemManagementService.startManager(SystemManagementService.java:432)
>   at 
> org.apache.geode.management.internal.beans.ManagementAdapter.handleCacheCreation(ManagementAdapter.java:181)
>   at 
> org.apache.geode.management.internal.beans.ManagementListener.handleEvent(ManagementListener.java:127)
>   at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.notifyResourceEventListeners(InternalDistributedSystem.java:2063)
>   at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.handleResourceEv

[jira] [Commented] (GEODE-8693) C++ native client Function.execute() with onServers does not throw exception if one of the servers goes down while executing the function.

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233099#comment-17233099
 ] 

ASF GitHub Bot commented on GEODE-8693:
---

echobravopapa commented on a change in pull request #690:
URL: https://github.com/apache/geode-native/pull/690#discussion_r524592099



##
File path: cppcache/src/ThinClientPoolDM.cpp
##
@@ -2414,7 +2415,7 @@ GfErrType FunctionExecution::execute() {
   m_error = m_poolDM->sendRequestToEP(request, reply, m_ep);
   m_error = m_poolDM->handleEPError(m_ep, reply, m_error);
   if (m_error != GF_NOERR) {
-if (m_error == GF_NOTCON || m_error == GF_IOERR) {
+if (m_error == GF_NOTCON) {

Review comment:
   how did you determine we no longer need to check for`GF_IOERR`?

##
File path: cppcache/src/ThinClientPoolDM.cpp
##
@@ -670,11 +670,12 @@ GfErrType ThinClientPoolDM::sendRequestToAllServers(
 err == GF_NOT_AUTHORIZED_EXCEPTION ||
 err == GF_AUTHENTICATION_REQUIRED_EXCEPTION) {
   finalErrorReturn = err;
-} else if (!(finalErrorReturn == GF_AUTHENTICATION_FAILED_EXCEPTION ||
- finalErrorReturn == GF_NOT_AUTHORIZED_EXCEPTION ||
- finalErrorReturn ==
- GF_AUTHENTICATION_REQUIRED_EXCEPTION))  // returning auth
- // errors
+} else if ((err != GF_NOERR) &&

Review comment:
   the check for `!GF_NOERR` looks like a good addition





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> C++ native client Function.execute() with onServers does not throw exception 
> if one of the servers goes down while executing the function.
> --
>
> Key: GEODE-8693
> URL: https://issues.apache.org/jira/browse/GEODE-8693
> Project: Geode
>  Issue Type: Bug
>  Components: native client
>Reporter: Alberto Gomez
>Assignee: Alberto Gomez
>Priority: Major
>  Labels: pull-request-available
>
> According to the Apache Geode Native C++ API documentation, the 
> FunctionService.onServers() function will throw an Exception if one of the 
> servers goes down while dispatching or executing the function on the server.
> Nevertheless, currently no exception is thrown in that case.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8623) Timing between DNS and Geode startup can result in permanent unknown host exceptions.

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233102#comment-17233102
 ] 

ASF GitHub Bot commented on GEODE-8623:
---

jinmeiliao commented on a change in pull request #5743:
URL: https://github.com/apache/geode/pull/5743#discussion_r524615847



##
File path: geode-common/src/main/java/org/apache/geode/internal/Retry.java
##
@@ -47,45 +47,44 @@ public void sleep(long sleepTime, TimeUnit sleepTimeUnit) 
throws InterruptedExce
 }
   }
 
-  private static final SteadyClock steadyClock = new SteadyClock();
+  private static final SteadyTimer steadyClock = new SteadyTimer();
 
   /**
* Try the supplier function until the predicate is true or timeout occurs.
*
* @param timeout to retry for
+   * @param timeoutUnit the unit for timeout
* @param interval time between each try
-   * @param timeUnit to retry for
+   * @param intervalUnit the unit for interval
* @param supplier to execute until predicate is true or times out
* @param predicate to test for retry
* @param  type of return value
* @return value from supplier after it passes predicate or times out.
*/
-  public static  T tryFor(long timeout,
-  long interval,
-  TimeUnit timeUnit,
+  public static  T tryFor(long timeout, TimeUnit timeoutUnit,
+  long interval, TimeUnit intervalUnit,
   Supplier supplier,
   Predicate predicate) throws TimeoutException, InterruptedException {
-return tryFor(timeout, interval, timeUnit, supplier, predicate, 
steadyClock);
+return tryFor(timeout, timeoutUnit, interval, intervalUnit, supplier, 
predicate, steadyClock);
   }
 
   @VisibleForTesting
-  static  T tryFor(long timeout,
-  long interval,
-  TimeUnit timeUnit,
+  static  T tryFor(long timeout, TimeUnit timeoutUnit,
+  long interval, TimeUnit intervalUnit,
   Supplier supplier,
   Predicate predicate,
-  Clock clock) throws TimeoutException, InterruptedException {
-long until = clock.nanoTime() + NANOSECONDS.convert(timeout, timeUnit);
+  Timer timer) throws TimeoutException, InterruptedException {
+long until = timer.nanoTime() + NANOSECONDS.convert(timeout, timeoutUnit);
 
 T value;
 do {
   value = supplier.get();
   if (predicate.test(value)) {
 return value;
   } else {
-clock.sleep(interval, timeUnit);
+timer.sleep(interval, intervalUnit);

Review comment:
   what I am saying is that even it reduces the interval it wouldn't try 
again, it will always just throw TimeoutException in the end.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Timing between DNS and Geode startup can result in permanent unknown host 
> exceptions.
> -
>
> Key: GEODE-8623
> URL: https://issues.apache.org/jira/browse/GEODE-8623
> Project: Geode
>  Issue Type: Bug
>Affects Versions: 1.9.0, 1.9.1, 1.10.0, 1.9.2, 1.11.0, 1.12.0, 1.13.0, 
> 1.14.0, 1.13.1
>Reporter: Jacob Barrett
>Priority: Minor
>  Labels: pull-request-available
>
> In a managed environment were local host name DNS entries and the startup of 
> Geode happen concurrently it is possible for Geode to fail name resolution in 
> the local hostname caching. If it fails to resolve the local hostname when 
> loading the caching utility class then any service dependent on this name 
> will fail without chance for recovery.
> {code}
> [error 2020/09/30 19:50:21.644 UTC  tid=0x1] Jmx manager could not be 
> started because java.net.UnknownHostException
> org.apache.geode.management.ManagementException: java.net.UnknownHostException
>   at 
> org.apache.geode.management.internal.ManagementAgent.startAgent(ManagementAgent.java:133)
>   at 
> org.apache.geode.management.internal.SystemManagementService.startManager(SystemManagementService.java:432)
>   at 
> org.apache.geode.management.internal.beans.ManagementAdapter.handleCacheCreation(ManagementAdapter.java:181)
>   at 
> org.apache.geode.management.internal.beans.ManagementListener.handleEvent(ManagementListener.java:127)
>   at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.notifyResourceEventListeners(InternalDistributedSystem.java:2063)
>   at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.handleResourceEvent(InternalDistributedSystem.java:606)
>   at 
> org.apache.geode.internal.cache.GemFireCacheImpl.initialize(GemFireCacheImpl.java:1239)
>   at 
> org.apache.ge

[jira] [Commented] (GEODE-8706) testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure in CI

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233108#comment-17233108
 ] 

ASF GitHub Bot commented on GEODE-8706:
---

gesterzhou merged pull request #5750:
URL: https://github.com/apache/geode/pull/5750


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure 
> in CI
> ---
>
> Key: GEODE-8706
> URL: https://issues.apache.org/jira/browse/GEODE-8706
> Project: Geode
>  Issue Type: Bug
>Reporter: John Hutchison
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
>
> __org.apache.geode.internal.cache.wan.offheap.SerialWANPropagationOffHeapDUnitTest
>  > testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode 
> FAILED
>  org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.wan.serial.SerialWANPropagationDUnitTest$$Lambda$181/1328086690.run
>  in VM 4 running on Host 3e4042fb1cd7 with 8 VMs
>  Caused by:
>  org.awaitility.core.ConditionTimeoutException: Assertion condition defined 
> as a lambda expression in org.apache.geode.internal.cache.wan.WANTestBase 
> that uses java.util.Set, java.util.Setint Expected queue entries: 0 but 
> actual entries: 8000 expected:<0> but was:<8000> within 5 minutes.
>  Caused by:
>  java.lang.AssertionError: Expected queue entries: 0 but actual entries: 8000 
> expected:<0> but was:<8000>
>  java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
>  Fix the strings or use IgnoredException.addIgnoredException to ignore.
>  --
>  Found suspect string in 'dunit_suspect-vm4.log' at line 68
>  [fatal 2020/11/13 18:09:30.989 GMT  for GatewaySender_ln> tid=619] Stopping the processor because the following 
> exception occurred while processing a batch: 
>  java.lang.NullPointerException
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.getConnection(GatewaySenderEventRemoteDispatcher.java:299)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.readAcknowledgement(GatewaySenderEventRemoteDispatcher.java:107)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher$AckReaderThread.run(GatewaySenderEventRemoteDispatcher.java:605)
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8706) testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure in CI

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233109#comment-17233109
 ] 

ASF subversion and git services commented on GEODE-8706:


Commit 399b875b493060d4acff0d1b41bdb37401181439 in geode's branch 
refs/heads/develop from Xiaojian Zhou
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=399b875 ]

GEODE-8706: getConnection should get readLock to sync with destroyCon… (#5750)



> testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure 
> in CI
> ---
>
> Key: GEODE-8706
> URL: https://issues.apache.org/jira/browse/GEODE-8706
> Project: Geode
>  Issue Type: Bug
>Reporter: John Hutchison
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
> Fix For: 1.14.0
>
>
> __org.apache.geode.internal.cache.wan.offheap.SerialWANPropagationOffHeapDUnitTest
>  > testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode 
> FAILED
>  org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.wan.serial.SerialWANPropagationDUnitTest$$Lambda$181/1328086690.run
>  in VM 4 running on Host 3e4042fb1cd7 with 8 VMs
>  Caused by:
>  org.awaitility.core.ConditionTimeoutException: Assertion condition defined 
> as a lambda expression in org.apache.geode.internal.cache.wan.WANTestBase 
> that uses java.util.Set, java.util.Setint Expected queue entries: 0 but 
> actual entries: 8000 expected:<0> but was:<8000> within 5 minutes.
>  Caused by:
>  java.lang.AssertionError: Expected queue entries: 0 but actual entries: 8000 
> expected:<0> but was:<8000>
>  java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
>  Fix the strings or use IgnoredException.addIgnoredException to ignore.
>  --
>  Found suspect string in 'dunit_suspect-vm4.log' at line 68
>  [fatal 2020/11/13 18:09:30.989 GMT  for GatewaySender_ln> tid=619] Stopping the processor because the following 
> exception occurred while processing a batch: 
>  java.lang.NullPointerException
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.getConnection(GatewaySenderEventRemoteDispatcher.java:299)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.readAcknowledgement(GatewaySenderEventRemoteDispatcher.java:107)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher$AckReaderThread.run(GatewaySenderEventRemoteDispatcher.java:605)
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (GEODE-8706) testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure in CI

2020-11-16 Thread Xiaojian Zhou (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiaojian Zhou resolved GEODE-8706.
--
Fix Version/s: 1.14.0
   Resolution: Fixed

> testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure 
> in CI
> ---
>
> Key: GEODE-8706
> URL: https://issues.apache.org/jira/browse/GEODE-8706
> Project: Geode
>  Issue Type: Bug
>Reporter: John Hutchison
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
> Fix For: 1.14.0
>
>
> __org.apache.geode.internal.cache.wan.offheap.SerialWANPropagationOffHeapDUnitTest
>  > testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode 
> FAILED
>  org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.wan.serial.SerialWANPropagationDUnitTest$$Lambda$181/1328086690.run
>  in VM 4 running on Host 3e4042fb1cd7 with 8 VMs
>  Caused by:
>  org.awaitility.core.ConditionTimeoutException: Assertion condition defined 
> as a lambda expression in org.apache.geode.internal.cache.wan.WANTestBase 
> that uses java.util.Set, java.util.Setint Expected queue entries: 0 but 
> actual entries: 8000 expected:<0> but was:<8000> within 5 minutes.
>  Caused by:
>  java.lang.AssertionError: Expected queue entries: 0 but actual entries: 8000 
> expected:<0> but was:<8000>
>  java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
>  Fix the strings or use IgnoredException.addIgnoredException to ignore.
>  --
>  Found suspect string in 'dunit_suspect-vm4.log' at line 68
>  [fatal 2020/11/13 18:09:30.989 GMT  for GatewaySender_ln> tid=619] Stopping the processor because the following 
> exception occurred while processing a batch: 
>  java.lang.NullPointerException
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.getConnection(GatewaySenderEventRemoteDispatcher.java:299)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.readAcknowledgement(GatewaySenderEventRemoteDispatcher.java:107)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher$AckReaderThread.run(GatewaySenderEventRemoteDispatcher.java:605)
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8623) Timing between DNS and Geode startup can result in permanent unknown host exceptions.

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233113#comment-17233113
 ] 

ASF GitHub Bot commented on GEODE-8623:
---

Bill commented on a change in pull request #5743:
URL: https://github.com/apache/geode/pull/5743#discussion_r524634772



##
File path: geode-common/src/main/java/org/apache/geode/internal/Retry.java
##
@@ -47,45 +47,44 @@ public void sleep(long sleepTime, TimeUnit sleepTimeUnit) 
throws InterruptedExce
 }
   }
 
-  private static final SteadyClock steadyClock = new SteadyClock();
+  private static final SteadyTimer steadyClock = new SteadyTimer();
 
   /**
* Try the supplier function until the predicate is true or timeout occurs.
*
* @param timeout to retry for
+   * @param timeoutUnit the unit for timeout
* @param interval time between each try
-   * @param timeUnit to retry for
+   * @param intervalUnit the unit for interval
* @param supplier to execute until predicate is true or times out
* @param predicate to test for retry
* @param  type of return value
* @return value from supplier after it passes predicate or times out.
*/
-  public static  T tryFor(long timeout,
-  long interval,
-  TimeUnit timeUnit,
+  public static  T tryFor(long timeout, TimeUnit timeoutUnit,
+  long interval, TimeUnit intervalUnit,
   Supplier supplier,
   Predicate predicate) throws TimeoutException, InterruptedException {
-return tryFor(timeout, interval, timeUnit, supplier, predicate, 
steadyClock);
+return tryFor(timeout, timeoutUnit, interval, intervalUnit, supplier, 
predicate, steadyClock);
   }
 
   @VisibleForTesting
-  static  T tryFor(long timeout,
-  long interval,
-  TimeUnit timeUnit,
+  static  T tryFor(long timeout, TimeUnit timeoutUnit,
+  long interval, TimeUnit intervalUnit,
   Supplier supplier,
   Predicate predicate,
-  Clock clock) throws TimeoutException, InterruptedException {
-long until = clock.nanoTime() + NANOSECONDS.convert(timeout, timeUnit);
+  Timer timer) throws TimeoutException, InterruptedException {
+long until = timer.nanoTime() + NANOSECONDS.convert(timeout, timeoutUnit);
 
 T value;
 do {
   value = supplier.get();
   if (predicate.test(value)) {
 return value;
   } else {
-clock.sleep(interval, timeUnit);
+timer.sleep(interval, intervalUnit);

Review comment:
   The example @jinmeiliao gave covers use-cases where the number of 
iterations is potentially large. Those are cases where time to run predicate + 
time to run supplier + sleep _interval_ << _timeout_.
   
   There are other use-cases where that relation does not hold. In those cases 
perhaps we will only execute the body of the loop once. Imagine time to run 
predicate + time to run supplier + sleep _interval_ === _timeout_ i.e. the 
timeout interval is roughly of the same magnitude as the other work to be done 
each time through the loop. In that case, @pivotal-jbarrett's concern about 
overshooting the timeout seem more important. This second scenario seems to me 
to be worth the effort of the added complexity of calculating `sleepTime = 
min(interval, until - now)` and returning immediately if that's 
less-than-or-equal-to zero.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Timing between DNS and Geode startup can result in permanent unknown host 
> exceptions.
> -
>
> Key: GEODE-8623
> URL: https://issues.apache.org/jira/browse/GEODE-8623
> Project: Geode
>  Issue Type: Bug
>Affects Versions: 1.9.0, 1.9.1, 1.10.0, 1.9.2, 1.11.0, 1.12.0, 1.13.0, 
> 1.14.0, 1.13.1
>Reporter: Jacob Barrett
>Priority: Minor
>  Labels: pull-request-available
>
> In a managed environment were local host name DNS entries and the startup of 
> Geode happen concurrently it is possible for Geode to fail name resolution in 
> the local hostname caching. If it fails to resolve the local hostname when 
> loading the caching utility class then any service dependent on this name 
> will fail without chance for recovery.
> {code}
> [error 2020/09/30 19:50:21.644 UTC  tid=0x1] Jmx manager could not be 
> started because java.net.UnknownHostException
> org.apache.geode.management.ManagementException: java.net.UnknownHostException
>   at 
> org.apache.geode.management.internal.ManagementAgent.startAgent(ManagementAgent.java:133)
>   at 
> org.apache.geode.management.internal.SystemManagem

[jira] [Commented] (GEODE-8623) Timing between DNS and Geode startup can result in permanent unknown host exceptions.

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233114#comment-17233114
 ] 

ASF GitHub Bot commented on GEODE-8623:
---

Bill commented on a change in pull request #5743:
URL: https://github.com/apache/geode/pull/5743#discussion_r524634772



##
File path: geode-common/src/main/java/org/apache/geode/internal/Retry.java
##
@@ -47,45 +47,44 @@ public void sleep(long sleepTime, TimeUnit sleepTimeUnit) 
throws InterruptedExce
 }
   }
 
-  private static final SteadyClock steadyClock = new SteadyClock();
+  private static final SteadyTimer steadyClock = new SteadyTimer();
 
   /**
* Try the supplier function until the predicate is true or timeout occurs.
*
* @param timeout to retry for
+   * @param timeoutUnit the unit for timeout
* @param interval time between each try
-   * @param timeUnit to retry for
+   * @param intervalUnit the unit for interval
* @param supplier to execute until predicate is true or times out
* @param predicate to test for retry
* @param  type of return value
* @return value from supplier after it passes predicate or times out.
*/
-  public static  T tryFor(long timeout,
-  long interval,
-  TimeUnit timeUnit,
+  public static  T tryFor(long timeout, TimeUnit timeoutUnit,
+  long interval, TimeUnit intervalUnit,
   Supplier supplier,
   Predicate predicate) throws TimeoutException, InterruptedException {
-return tryFor(timeout, interval, timeUnit, supplier, predicate, 
steadyClock);
+return tryFor(timeout, timeoutUnit, interval, intervalUnit, supplier, 
predicate, steadyClock);
   }
 
   @VisibleForTesting
-  static  T tryFor(long timeout,
-  long interval,
-  TimeUnit timeUnit,
+  static  T tryFor(long timeout, TimeUnit timeoutUnit,
+  long interval, TimeUnit intervalUnit,
   Supplier supplier,
   Predicate predicate,
-  Clock clock) throws TimeoutException, InterruptedException {
-long until = clock.nanoTime() + NANOSECONDS.convert(timeout, timeUnit);
+  Timer timer) throws TimeoutException, InterruptedException {
+long until = timer.nanoTime() + NANOSECONDS.convert(timeout, timeoutUnit);
 
 T value;
 do {
   value = supplier.get();
   if (predicate.test(value)) {
 return value;
   } else {
-clock.sleep(interval, timeUnit);
+timer.sleep(interval, intervalUnit);

Review comment:
   The example @jinmeiliao gave covers use-cases where the number of 
iterations is potentially large. Those are cases where time to run predicate + 
time to run supplier + sleep _interval_ << _timeout_.
   
   There are other use-cases where that relation does not hold. In those cases 
perhaps we will only execute the body of the loop once. Imagine time to run 
predicate + time to run supplier + sleep _interval_ (=roughly-equal-to=) 
_timeout_ i.e. the timeout interval is roughly of the same magnitude as the 
other work to be done each time through the loop. In that case, 
@pivotal-jbarrett's concern about overshooting the timeout seem more important. 
This second scenario seems to me to be worth the effort of the added complexity 
of calculating `sleepTime = min(interval, until - now)` and returning 
immediately if that's less-than-or-equal-to zero.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Timing between DNS and Geode startup can result in permanent unknown host 
> exceptions.
> -
>
> Key: GEODE-8623
> URL: https://issues.apache.org/jira/browse/GEODE-8623
> Project: Geode
>  Issue Type: Bug
>Affects Versions: 1.9.0, 1.9.1, 1.10.0, 1.9.2, 1.11.0, 1.12.0, 1.13.0, 
> 1.14.0, 1.13.1
>Reporter: Jacob Barrett
>Priority: Minor
>  Labels: pull-request-available
>
> In a managed environment were local host name DNS entries and the startup of 
> Geode happen concurrently it is possible for Geode to fail name resolution in 
> the local hostname caching. If it fails to resolve the local hostname when 
> loading the caching utility class then any service dependent on this name 
> will fail without chance for recovery.
> {code}
> [error 2020/09/30 19:50:21.644 UTC  tid=0x1] Jmx manager could not be 
> started because java.net.UnknownHostException
> org.apache.geode.management.ManagementException: java.net.UnknownHostException
>   at 
> org.apache.geode.management.internal.ManagementAgent.startAgent(ManagementAgent.java:133)
>   at 
> org.apache.geode.management.inter

[jira] [Commented] (GEODE-8702) Add StringPrefixPartitionResolver

2020-11-16 Thread Ernest Burghardt (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233119#comment-17233119
 ] 

Ernest Burghardt commented on GEODE-8702:
-

[~gaussianrecurrence]  I'm curious what the use case for this is aside from 
parity with Java API...

> Add StringPrefixPartitionResolver
> -
>
> Key: GEODE-8702
> URL: https://issues.apache.org/jira/browse/GEODE-8702
> Project: Geode
>  Issue Type: New Feature
>  Components: native client
>Affects Versions: 1.13.0
>Reporter: Mario Salazar de Torres
>Assignee: Mario Salazar de Torres
>Priority: Major
>  Labels: pull-request-available
>
> *AS A* native client contributor
> *I WANT* to have StringPrefixPartitionResolver implemented
> *SO THAT* so I can just use it as for the Java API is done



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8694) Use fork of Redis in Gemfire org to run Redis tests in CI

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233117#comment-17233117
 ] 

ASF GitHub Bot commented on GEODE-8694:
---

nonbinaryprogrammer commented on a change in pull request #5732:
URL: https://github.com/apache/geode/pull/5732#discussion_r524645049



##
File path: ci/scripts/execute_redis_tests.sh
##
@@ -19,12 +19,14 @@
 
 cd ..
 
-# We are currently using a personal fork for this repo because our code does 
not implement all
+# We are currently using a patched version of this repo because our code does 
not implement all
 # Redis commands.  Once all commands needed to run relevant test files are 
implemented, we hope to
-# use Redis's repo instead.
-git clone --config transfer.fsckObjects=false 
https://github.com/prettyClouds/redis.git
+# use Redis's repo without a patch.
+git clone --config transfer.fsckObjects=false 
https://github.com/redis/redis.git
+cp geode-redis/src/acceptanceTest/resources/0001-configure-redis-tests.patch 
redis
 cd redis
-git checkout tests-geode-redis
+git checkout origin/5.0
+git am < 0001-configure-redis-tests.patch

Review comment:
   thanks @sabbey37! I'll do some more testing on that





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Use fork of Redis in Gemfire org to run Redis tests in CI
> -
>
> Key: GEODE-8694
> URL: https://issues.apache.org/jira/browse/GEODE-8694
> Project: Geode
>  Issue Type: Improvement
>  Components: redis
>Reporter: Sarah Abbey
>Assignee: Helena Bales
>Priority: Trivial
>  Labels: pull-request-available
>
> Currently, we are using a contributor's personal fork of Redis to run tests 
> against Geode Redis in CI.  This should be switched to the fork of Redis in 
> the Gemfire org.  Ideally, we will eventually be able to run tests from the 
> root Redis repo (currently we are unable to since we are not implementing all 
> the necessary commands).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (GEODE-8702) Add StringPrefixPartitionResolver

2020-11-16 Thread Mario Salazar de Torres (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233129#comment-17233129
 ] 

Mario Salazar de Torres edited comment on GEODE-8702 at 11/16/20, 10:09 PM:


Hi [~echobravo],

Mainly parity with Java API.
 Basically, if you can specify a PartitionResolver in the server side, it makes 
sense that this very PartitionResolver is available in all the client APIs.
 Otherwise, each user of geode-native will have to maintain its own 
implementation of StringPrefixPartitionResolver.


was (Author: gaussianrecurrence):
Hi [~echobravo],

Mainly parity with Java API.
Basically, if you can specify a PartitionResolver in the server side, it makes 
sense that this very PartitionResolver is available in all the client APIs.
Otherwise, each user of geode-native will have to maintain its own 
implementation of StringPrefixPartitionResolver.

BR,
Mario.

> Add StringPrefixPartitionResolver
> -
>
> Key: GEODE-8702
> URL: https://issues.apache.org/jira/browse/GEODE-8702
> Project: Geode
>  Issue Type: New Feature
>  Components: native client
>Affects Versions: 1.13.0
>Reporter: Mario Salazar de Torres
>Assignee: Mario Salazar de Torres
>Priority: Major
>  Labels: pull-request-available
>
> *AS A* native client contributor
> *I WANT* to have StringPrefixPartitionResolver implemented
> *SO THAT* so I can just use it as for the Java API is done



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8702) Add StringPrefixPartitionResolver

2020-11-16 Thread Mario Salazar de Torres (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233129#comment-17233129
 ] 

Mario Salazar de Torres commented on GEODE-8702:


Hi [~echobravo],

Mainly parity with Java API.
Basically, if you can specify a PartitionResolver in the server side, it makes 
sense that this very PartitionResolver is available in all the client APIs.
Otherwise, each user of geode-native will have to maintain its own 
implementation of StringPrefixPartitionResolver.

BR,
Mario.

> Add StringPrefixPartitionResolver
> -
>
> Key: GEODE-8702
> URL: https://issues.apache.org/jira/browse/GEODE-8702
> Project: Geode
>  Issue Type: New Feature
>  Components: native client
>Affects Versions: 1.13.0
>Reporter: Mario Salazar de Torres
>Assignee: Mario Salazar de Torres
>Priority: Major
>  Labels: pull-request-available
>
> *AS A* native client contributor
> *I WANT* to have StringPrefixPartitionResolver implemented
> *SO THAT* so I can just use it as for the Java API is done



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (GEODE-8713) Geode-Native .NET user guide: API reference links point to C++ API

2020-11-16 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-8713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated GEODE-8713:
--
Labels: pull-request-available  (was: )

> Geode-Native .NET user guide: API reference links point to C++ API
> --
>
> Key: GEODE-8713
> URL: https://issues.apache.org/jira/browse/GEODE-8713
> Project: Geode
>  Issue Type: Bug
>  Components: docs
>Affects Versions: 1.13.0
>Reporter: Dave Barnes
>Assignee: Dave Barnes
>Priority: Major
>  Labels: pull-request-available
>
> The "Client Cache XML Reference" section in the .NET user guide contains 
> links to "API Class Reference" that all point to the C++ API ref. These 
> should point to the .NET API ref.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8713) Geode-Native .NET user guide: API reference links point to C++ API

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233159#comment-17233159
 ] 

ASF GitHub Bot commented on GEODE-8713:
---

davebarnes97 merged pull request #694:
URL: https://github.com/apache/geode-native/pull/694


   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Geode-Native .NET user guide: API reference links point to C++ API
> --
>
> Key: GEODE-8713
> URL: https://issues.apache.org/jira/browse/GEODE-8713
> Project: Geode
>  Issue Type: Bug
>  Components: docs
>Affects Versions: 1.13.0
>Reporter: Dave Barnes
>Assignee: Dave Barnes
>Priority: Major
>
> The "Client Cache XML Reference" section in the .NET user guide contains 
> links to "API Class Reference" that all point to the C++ API ref. These 
> should point to the .NET API ref.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8442) Exception in server not identified correctly in client

2020-11-16 Thread ASF GitHub Bot (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233305#comment-17233305
 ] 

ASF GitHub Bot commented on GEODE-8442:
---

mkevo commented on pull request #693:
URL: https://github.com/apache/geode-native/pull/693#issuecomment-728731731


   I'm trying to write a test for this, as I reproduce it manually. There are 
steps to reproduce the issue:
   
   1. Start locator and three servers.
   2. Run some puts and then run containsKeyOnServer function with transactions.
   3. While containsKeyOnServer is executing kill lead server.
   4. You will got UnknownException without this change.
   
   So if you have some idea how to do this containsKeyOnServer and killing 
server lead in parallel, it will help a lot.
   @echobravopapa , yes, I intend this to be a draft.
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Exception in server not identified correctly in client
> --
>
> Key: GEODE-8442
> URL: https://issues.apache.org/jira/browse/GEODE-8442
> Project: Geode
>  Issue Type: Bug
>  Components: native client
>Reporter: Alberto Bustamante Reyes
>Assignee: Mario Kevo
>Priority: Major
>  Labels: pull-request-available
>
> Native client is not identifying correctly an exception returned by the 
> server.
> This is a log from an exception properly identified (I have introduced "" 
> where I think I have to hide sensitive information):
> {code}
> [error 2020/07/22 09:17:10.831337 UTC ] 
> CacheTransactionManager::failover: An exception 
> (org.apache.geode.cache.TransactionDataNodeHasDepartedException: Could not 
> find transaction host for TXId: 
> (default_GeodeDS:1:loner):3:Native_gdbdedfebe1:default_GeodeDS:6022
>   at 
> org.apache.geode.internal.cache.tier.sockets.command.TXFailoverCommand.cmdExecute(TXFailoverCommand.java:126)
>   at 
> org.apache.geode.internal.cache.tier.sockets.BaseCommand.execute(BaseCommand.java:177)
>   at 
> org.apache.geode.internal.cache.tier.sockets.ServerConnection.doNormalMessage(ServerConnection.java:848)
>   at 
> org.apache.geode.internal.cache.tier.sockets.OriginalServerConnection.doOneMessage(OriginalServerConnection.java:72)
>   at 
> org.apache.geode.internal.cache.tier.sockets.ServerConnection.run(ServerConnection.java:1212)
>   at 
> java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
>   at 
> java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
>   at 
> org.apache.geode.internal.cache.tier.sockets.AcceptorImpl.lambda$initializeServerConnectionThreadPool$3(AcceptorImpl.java:676)
>   at 
> org.apache.geode.logging.internal.executors.LoggingThreadFactory.lambda$newThread$0(LoggingThreadFactory.java:119)
>   at java.base/java.lang.Thread.run(Thread.java:834)
> ) happened at remote server.
> data-access: write_element:[TransactionDataNodeHasDepartedException] 
> .get()::apache::geode::client::TransactionDataNodeHasDepartedException:org.apache.geode.cache.TransactionDataNodeHasDepartedException:
>  Could not find transaction host for TXId: 
> (default_GeodeDS:1:loner):3:Native_gdbdedfebe1:default_GeodeDS:6022
> {code}
> But in this case, although the exception received is also a 
> TransactionDataNodeHasDepartedException, it is "translated" to an Unknown 
> exception:
> {code}
> [error 2020/07/22 09:17:10.979506 UTC ] Region::containsKeyOnServer:: An 
> exception (org.apache.geode.cache.TransactionDataNodeHasDepartedException: 
> Transaction data node 192.168.240.13(dms-server-1:1):41000 has departed. 
> To proceed, rollback this transaction and begin a new one., caused by 
> org.apache.geode.internal.cache.ForceReattemptException: PartitionResponse 
> got remote CacheClosedException
>   at 
> org.apache.geode.internal.cache.tx.PartitionedTXRegionStub.containsKey(PartitionedTXRegionStub.java:247)
>   at 
> org.apache.geode.internal.cache.TXStateStub.containsKey(TXStateStub.java:431)
>   at 
> org.apache.geode.internal.cache.TXStateProxyImpl.containsKey(TXStateProxyImpl.java:530)
>   at 
> org.apache.geode.internal.cache.PartitionedRegion.containsKey(PartitionedRegion.java:6402)
>   at 
> org.apache.geode.internal.cache.tier.sockets.command.ContainsKey66.cmdExecute(ContainsKey66.java:138)
>   at 
> org.apache.geode.internal.cache.tier.sockets.BaseCommand.execute(BaseCommand.java:177)
>   at 
> org.apache.geode.internal.cache.tier.sockets.ServerCo

[jira] [Commented] (GEODE-8692) ArrayIndexOutOfBoundsException may be thrown in RegionAdvisor.processProfilesQueuedDuringInitialization

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233311#comment-17233311
 ] 

ASF subversion and git services commented on GEODE-8692:


Commit c99087aeb19abfb5bbd57036349870a6d784df1a in geode's branch 
refs/heads/feature/GEODE-8444 from Sarah
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=c99087a ]

GEODE-8692: ArrayIndexOutOfBoundsException may be thrown in 
RegionAdvisor.processProfilesQueuedDuringInitialization (#5722)



> ArrayIndexOutOfBoundsException may be thrown in 
> RegionAdvisor.processProfilesQueuedDuringInitialization
> ---
>
> Key: GEODE-8692
> URL: https://issues.apache.org/jira/browse/GEODE-8692
> Project: Geode
>  Issue Type: Bug
>Reporter: Sarah Abbey
>Assignee: Sarah Abbey
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> If {{RegionAdvisor.buckets == null}} at the time the value of {{serials}} is 
> generated, an {{ArrayIndexOutOfBoundsException}} may be thrown in 
> {{RegionAdvisor.processProfilesQueuedDuringInitialization}} if buckets have 
> been initialized at that point.
> Code where {{ArrayIndexOutOfBoundsException}} might be thrown in 
> {{RegionAdvisor.processProfilesQueuedDuringInitialization():}}
> {code:java}
> for (int i = 0; i < buckets.length; i++) {
>   BucketAdvisor ba = buckets[i].getBucketAdvisor();
>   int serial = qbp.serials[i]; <<< Exception thrown here
>   if (serial != ILLEGAL_SERIAL) {
> ba.removeIdWithSerial(qbp.memberId, serial, qbp.destroyed);
>   }
> }
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8664) JGroups exception is not propagated

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233313#comment-17233313
 ] 

ASF subversion and git services commented on GEODE-8664:


Commit f23e01a32213245dc684b417d97a14cf5a026741 in geode's branch 
refs/heads/feature/GEODE-8444 from Mario Salazar de Torres
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=f23e01a ]

Revert "GEODE-8664: Nest errors in DistributionImpl.start (#5725)" (#5742)

This reverts commit f6605e0820ba9b858b8128cd31a03a29067b7710.

> JGroups exception is not propagated
> ---
>
> Key: GEODE-8664
> URL: https://issues.apache.org/jira/browse/GEODE-8664
> Project: Geode
>  Issue Type: Bug
>  Components: membership, messaging
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Mario Salazar de Torres
>Assignee: Mario Salazar de Torres
>Priority: Minor
>  Labels: pull-request-available
> Attachments: GEODE-8664.patch
>
>
> *AS A* geode contributor
> *I WANT* exceptions thrown in DistributionImpl.start to be propagated
> *SO THAT* more information is provided while tackling issues.
>  
> 
> *Additional information:*
> After looking at an issue while starting a locator having to do with 
> JGroupMessenger I noticed that exceptions from Membership.start are not 
> correctly propagated in DistributionImpl.start method.
> To ilustrate the problem I attach below the reported exception while starting 
> the locator:
>  
> {noformat}
> locator is exiting due to an exception
> org.apache.geode.GemFireConfigException: unable to create jgroups channel
> at 
> org.apache.geode.distributed.internal.DistributionImpl.start(DistributionImpl.java:184)
> at 
> org.apache.geode.distributed.internal.DistributionImpl.createDistribution(DistributionImpl.java:222)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:464)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:497)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.create(ClusterDistributionManager.java:326)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:779)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:135)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3033)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:290)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startDistributedSystem(InternalLocator.java:743)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:388)
> at 
> org.apache.geode.distributed.LocatorLauncher.start(LocatorLauncher.java:716)
> at org.apache.geode.distributed.LocatorLauncher.run(LocatorLauncher.java:623)
> at 
> org.apache.geode.distributed.LocatorLauncher.main(LocatorLauncher.java:217){noformat}
>  
> Thing is with that kind of information there's no way to know why the problem 
> is happening, so the idea would be to propagate the exceptions, so it looks 
> more like this:
> {noformat}
> locator is exiting due to an exception
> org.apache.geode.SystemConnectException: unable to create jgroups channel
> at 
> org.apache.geode.distributed.internal.DistributionImpl.start(DistributionImpl.java:186)
> at 
> org.apache.geode.distributed.internal.DistributionImpl.createDistribution(DistributionImpl.java:222)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:464)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:497)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.create(ClusterDistributionManager.java:326)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:779)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:135)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3034)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:290)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startDistributedSystem(InternalLocator.java:743)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:388)
> at 
> org.apache.geode.distributed.LocatorLauncher.start(LocatorLauncher.java:716)
> at org.apache.geode.distributed.LocatorLauncher.run(LocatorLauncher.

[jira] [Commented] (GEODE-8704) many CI failures in Jetty9CachingClientServerTest

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233314#comment-17233314
 ] 

ASF subversion and git services commented on GEODE-8704:


Commit 8f8cc7b62be9e7d3a20f18e38fe2cff3228a55bb in geode's branch 
refs/heads/feature/GEODE-8444 from Sarah
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=8f8cc7b ]

GEODE-8704: many CI failures in Jetty9CachingClientServerTest (#5745)



> many CI failures in Jetty9CachingClientServerTest
> -
>
> Key: GEODE-8704
> URL: https://issues.apache.org/jira/browse/GEODE-8704
> Project: Geode
>  Issue Type: Bug
>  Components: http session
>Affects Versions: 1.14.0
>Reporter: Kamilla Aslami
>Assignee: Benjamin P Ross
>Priority: Major
>  Labels: pull-request-available
>
> A bunch of failures:
> {code:java}
> org.apache.geode.session.tests.Jetty9CachingClientServerTest > 
> shouldCacheSessionOnClient FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.internal.IdentifiableRunnable.run in VM 1 running 
> on Host e4bd15534025 with 4 VMs
> at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:623)
> at org.apache.geode.test.dunit.VM.invoke(VM.java:434)
> at 
> org.apache.geode.test.junit.rules.VMProvider.invoke(VMProvider.java:102)
> at 
> org.apache.geode.session.tests.Jetty9CachingClientServerTest.shouldCacheSessionOnClient(Jetty9CachingClientServerTest.java:57)
> Caused by:
> java.lang.IllegalStateException: Session is invalid
> at 
> org.apache.geode.modules.session.internal.filter.GemfireHttpSession.checkValid(GemfireHttpSession.java:317)
> at 
> org.apache.geode.modules.session.internal.filter.GemfireHttpSession.setAttribute(GemfireHttpSession.java:301)
> at 
> org.apache.geode.session.tests.Jetty9CachingClientServerTest.lambda$null$0(Jetty9CachingClientServerTest.java:60)
> at java.lang.Iterable.forEach(Iterable.java:75)
> at 
> java.util.Collections$UnmodifiableCollection.forEach(Collections.java:1085)
> at 
> org.apache.geode.session.tests.Jetty9CachingClientServerTest.lambda$shouldCacheSessionOnClient$6b5e8371$1(Jetty9CachingClientServerTest.java:60)org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > shouldNotLeaveNativeSessionInContainer FAILED
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldReplicateCookies FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> 
> java.util.concurrent.TimeoutExceptionorg.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldShareSessionExpirationReset FAILED
> org.junit.ComparisonFailure: 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldExpireInSetTimeframe FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldHavePersistentSessionData FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > failureShouldStillAllowOtherContainersDataAccess FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > invalidationShouldRemoveValueAccessForAllContainers FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.internal.IdentifiableRunnable.run in

[jira] [Commented] (GEODE-8706) testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure in CI

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233321#comment-17233321
 ] 

ASF subversion and git services commented on GEODE-8706:


Commit 399b875b493060d4acff0d1b41bdb37401181439 in geode's branch 
refs/heads/feature/GEODE-8444 from Xiaojian Zhou
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=399b875 ]

GEODE-8706: getConnection should get readLock to sync with destroyCon… (#5750)



> testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure 
> in CI
> ---
>
> Key: GEODE-8706
> URL: https://issues.apache.org/jira/browse/GEODE-8706
> Project: Geode
>  Issue Type: Bug
>Reporter: John Hutchison
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
> Fix For: 1.14.0
>
>
> __org.apache.geode.internal.cache.wan.offheap.SerialWANPropagationOffHeapDUnitTest
>  > testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode 
> FAILED
>  org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.wan.serial.SerialWANPropagationDUnitTest$$Lambda$181/1328086690.run
>  in VM 4 running on Host 3e4042fb1cd7 with 8 VMs
>  Caused by:
>  org.awaitility.core.ConditionTimeoutException: Assertion condition defined 
> as a lambda expression in org.apache.geode.internal.cache.wan.WANTestBase 
> that uses java.util.Set, java.util.Setint Expected queue entries: 0 but 
> actual entries: 8000 expected:<0> but was:<8000> within 5 minutes.
>  Caused by:
>  java.lang.AssertionError: Expected queue entries: 0 but actual entries: 8000 
> expected:<0> but was:<8000>
>  java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
>  Fix the strings or use IgnoredException.addIgnoredException to ignore.
>  --
>  Found suspect string in 'dunit_suspect-vm4.log' at line 68
>  [fatal 2020/11/13 18:09:30.989 GMT  for GatewaySender_ln> tid=619] Stopping the processor because the following 
> exception occurred while processing a batch: 
>  java.lang.NullPointerException
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.getConnection(GatewaySenderEventRemoteDispatcher.java:299)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.readAcknowledgement(GatewaySenderEventRemoteDispatcher.java:107)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher$AckReaderThread.run(GatewaySenderEventRemoteDispatcher.java:605)
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8697) Propagate ForcedDisconnectException to the user application in a network partition scenario

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233319#comment-17233319
 ] 

ASF subversion and git services commented on GEODE-8697:


Commit 403e19c0a2b85369274e8254c16e0ae508b82e94 in geode's branch 
refs/heads/feature/GEODE-8444 from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=403e19c ]

GEODE-8697: Propagate ForcedDisconnectException to the user application (#5739)

* GEODE-8697: Propagate ForcedDisconnectException to the user application

avoid setting MemberDisconnectedException as a rootCause in
ClusterDistributionManager, even temporarily, as it's an internal
exception that should not be exposed to applications.

* addressing Ernie's comments

> Propagate ForcedDisconnectException to the user application in a network 
> partition scenario
> ---
>
> Key: GEODE-8697
> URL: https://issues.apache.org/jira/browse/GEODE-8697
> Project: Geode
>  Issue Type: Bug
>  Components: membership
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Kamilla Aslami
>Assignee: Bruce J Schuchardt
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> During network partitioning, we expect that the coordinator closes its 
> cluster with a ForcedDisconnectException. However, in some cases, threads end 
> up with a MemberDisconnectedException.
> System logs show that a ForcedDisconnect has happened:
> {code:java}
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.forceDisconnect(GMSMembership.java:2007)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1085)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:1422)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1327)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1266)
>  at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
>  at org.jgroups.JChannel.up(JChannel.java:741)
>  at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
>  at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
>  at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
>  at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1077)
>  at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:792)
>  at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:433)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:72)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:70)
>  at org.jgroups.protocols.TP.passMessageUp(TP.java:1658)
>  at org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1876)
>  at org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
>  at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1789)
>  at org.jgroups.protocols.TP.receive(TP.java:1714)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:159)
>  at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
>  at java.lang.Thread.run(Thread.java:748){code}
> But it is never propagated upwards to the user application:
> {code:java}
> org.apache.geode.distributed.DistributedSystemDisconnectedException: This 
> connection to a distributed system has been disconnected., caused by 
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.checkConnected(InternalDistributedSystem.java:978)
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.getDistributionManager(InternalDistributedSystem.java:1679)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.getDistributionManager(ReplyProcessor21.java:366)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.postWait(ReplyProcessor21.java:600)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterruptibly(ReplyProcessor21.java:824)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterrup

[jira] [Commented] (GEODE-8670) ReconnectDUnitTest is hiding a NullPointerException

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233316#comment-17233316
 ] 

ASF subversion and git services commented on GEODE-8670:


Commit 99e7a138064419c9a676411252e64077c2a1180f in geode's branch 
refs/heads/feature/GEODE-8444 from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=99e7a13 ]

GEODE-8670: ReconnectDUnitTest is hiding a NullPointerException (#5744)

* GEODE-8670: ReconnectDUnitTest is hiding a NullPointerException

1. Fixed all of the IgnorableException problems in ReconnectDUnitTest
2. Modified InternalLocator to avoid throwing a NullPointerException and
   added a test for this change

* fixing invalid test - had to move it to a new class to use SystemOutRule

the rule wasn't working for some reason when I added it to
InternalLocatorIntegrationTest

* removed use of SystemOutRule, which wasn't working in Stress tests

> ReconnectDUnitTest is hiding a NullPointerException
> ---
>
> Key: GEODE-8670
> URL: https://issues.apache.org/jira/browse/GEODE-8670
> Project: Geode
>  Issue Type: Test
>  Components: membership, tests
>Affects Versions: 1.14.0
>Reporter: Jens Deppe
>Assignee: Bruce J Schuchardt
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> Since working on GEODE-8609, I found that the regex in ReconnectDUnitTest is 
> incorrect. Specifically it looks like this:
> {noformat}
> IgnoredException.addIgnoredException("ForcedDisconnectException||Possible 
> loss of quorum");
> {noformat}
> When removing the double pipe ({{||}}) various test methods fail with a 
> {{NullPointerException}}. For example:
> {noformat}
> java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
> Fix the strings or use IgnoredException.addIgnoredException to ignore.
> ---
> Found suspect string in 'dunit_suspect-vm1.log' at line 692
> [fatal 2020/10/29 05:59:26.027 PDT  
> tid=203] Uncaught exception in thread Thread[Location services restart 
> thread,5,RMI Runtime]
> java.lang.NullPointerException
>   at 
> org.apache.geode.distributed.internal.InternalLocator.attemptReconnect(InternalLocator.java:1108)
>   at 
> org.apache.geode.distributed.internal.InternalLocator.lambda$launchRestartThread$4(InternalLocator.java:1070)
>   at java.lang.Thread.run(Thread.java:748)
> {noformat}
> There is also an additional {{DistributedSystemDisconnectedException}} which 
> probably just requires an addition to the other ignored exceptions.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8697) Propagate ForcedDisconnectException to the user application in a network partition scenario

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233318#comment-17233318
 ] 

ASF subversion and git services commented on GEODE-8697:


Commit 403e19c0a2b85369274e8254c16e0ae508b82e94 in geode's branch 
refs/heads/feature/GEODE-8444 from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=403e19c ]

GEODE-8697: Propagate ForcedDisconnectException to the user application (#5739)

* GEODE-8697: Propagate ForcedDisconnectException to the user application

avoid setting MemberDisconnectedException as a rootCause in
ClusterDistributionManager, even temporarily, as it's an internal
exception that should not be exposed to applications.

* addressing Ernie's comments

> Propagate ForcedDisconnectException to the user application in a network 
> partition scenario
> ---
>
> Key: GEODE-8697
> URL: https://issues.apache.org/jira/browse/GEODE-8697
> Project: Geode
>  Issue Type: Bug
>  Components: membership
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Kamilla Aslami
>Assignee: Bruce J Schuchardt
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> During network partitioning, we expect that the coordinator closes its 
> cluster with a ForcedDisconnectException. However, in some cases, threads end 
> up with a MemberDisconnectedException.
> System logs show that a ForcedDisconnect has happened:
> {code:java}
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.forceDisconnect(GMSMembership.java:2007)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1085)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:1422)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1327)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1266)
>  at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
>  at org.jgroups.JChannel.up(JChannel.java:741)
>  at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
>  at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
>  at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
>  at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1077)
>  at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:792)
>  at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:433)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:72)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:70)
>  at org.jgroups.protocols.TP.passMessageUp(TP.java:1658)
>  at org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1876)
>  at org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
>  at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1789)
>  at org.jgroups.protocols.TP.receive(TP.java:1714)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:159)
>  at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
>  at java.lang.Thread.run(Thread.java:748){code}
> But it is never propagated upwards to the user application:
> {code:java}
> org.apache.geode.distributed.DistributedSystemDisconnectedException: This 
> connection to a distributed system has been disconnected., caused by 
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.checkConnected(InternalDistributedSystem.java:978)
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.getDistributionManager(InternalDistributedSystem.java:1679)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.getDistributionManager(ReplyProcessor21.java:366)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.postWait(ReplyProcessor21.java:600)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterruptibly(ReplyProcessor21.java:824)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterrup

[jira] [Commented] (GEODE-8672) Concurrent transactional destroy with GII could cause an entry to be removed and version information to be lost

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233315#comment-17233315
 ] 

ASF subversion and git services commented on GEODE-8672:


Commit 7a55cba21cbe1fd22668c014a8a96ff161a75820 in geode's branch 
refs/heads/feature/GEODE-8444 from Eric Shu
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=7a55cba ]

GEODE-8672: No need in token mode if concurrencyChecksEnabled (#5746)

  * The DESTROYED token is only needed to prevent concurrent destroy op
is lost in GII. If concurrency checks are enabled, the version tag
should be able to prevent the destroy op being lost.
  * Correctly set the inTokenMode to avoid a potential hang in replicate
region as invokeTXCallbacks needs to wait for the region initialization
finished, but region intialization could not due to lock held by
threads invoking callbacks.


> Concurrent transactional destroy with GII could cause an entry to be removed 
> and version information to be lost
> ---
>
> Key: GEODE-8672
> URL: https://issues.apache.org/jira/browse/GEODE-8672
> Project: Geode
>  Issue Type: Bug
>  Components: regions
>Affects Versions: 1.1.0
>Reporter: Eric Shu
>Assignee: Eric Shu
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> In a newly rebalanced bucket, while GII is in progress, a transactional 
> destroy is applied to cache. There is a logic that it should be in token mode 
> and leaves the entry as a Destroyed token, even though the version tag of the 
> entry indicates that it has the correct version.
> However, at end of the GII, there is a 
> cleanUpDestroyedTokensAndMarkGIIComplete method removes all the destroyed 
> entries – this wipes off the entry version tag information and cause the 
> subsequent creates starts fresh with new version tags.
> This could leads to client server data inconsistency as the newly created 
> entries will be ignored by the clients as the newly created entry has lower 
> version number while client has high ones.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8670) ReconnectDUnitTest is hiding a NullPointerException

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233317#comment-17233317
 ] 

ASF subversion and git services commented on GEODE-8670:


Commit 99e7a138064419c9a676411252e64077c2a1180f in geode's branch 
refs/heads/feature/GEODE-8444 from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=99e7a13 ]

GEODE-8670: ReconnectDUnitTest is hiding a NullPointerException (#5744)

* GEODE-8670: ReconnectDUnitTest is hiding a NullPointerException

1. Fixed all of the IgnorableException problems in ReconnectDUnitTest
2. Modified InternalLocator to avoid throwing a NullPointerException and
   added a test for this change

* fixing invalid test - had to move it to a new class to use SystemOutRule

the rule wasn't working for some reason when I added it to
InternalLocatorIntegrationTest

* removed use of SystemOutRule, which wasn't working in Stress tests

> ReconnectDUnitTest is hiding a NullPointerException
> ---
>
> Key: GEODE-8670
> URL: https://issues.apache.org/jira/browse/GEODE-8670
> Project: Geode
>  Issue Type: Test
>  Components: membership, tests
>Affects Versions: 1.14.0
>Reporter: Jens Deppe
>Assignee: Bruce J Schuchardt
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> Since working on GEODE-8609, I found that the regex in ReconnectDUnitTest is 
> incorrect. Specifically it looks like this:
> {noformat}
> IgnoredException.addIgnoredException("ForcedDisconnectException||Possible 
> loss of quorum");
> {noformat}
> When removing the double pipe ({{||}}) various test methods fail with a 
> {{NullPointerException}}. For example:
> {noformat}
> java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
> Fix the strings or use IgnoredException.addIgnoredException to ignore.
> ---
> Found suspect string in 'dunit_suspect-vm1.log' at line 692
> [fatal 2020/10/29 05:59:26.027 PDT  
> tid=203] Uncaught exception in thread Thread[Location services restart 
> thread,5,RMI Runtime]
> java.lang.NullPointerException
>   at 
> org.apache.geode.distributed.internal.InternalLocator.attemptReconnect(InternalLocator.java:1108)
>   at 
> org.apache.geode.distributed.internal.InternalLocator.lambda$launchRestartThread$4(InternalLocator.java:1070)
>   at java.lang.Thread.run(Thread.java:748)
> {noformat}
> There is also an additional {{DistributedSystemDisconnectedException}} which 
> probably just requires an addition to the other ignored exceptions.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8706) testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure in CI

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233320#comment-17233320
 ] 

ASF subversion and git services commented on GEODE-8706:


Commit 1bf18b2e9d3bee8c75ec2acb2331a6c12cc886b6 in geode's branch 
refs/heads/feature/GEODE-8444 from John Hutchison
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=1bf18b2 ]

GEODE-8706: Redis INFO command 'Keyspace' section should not be present if no 
keys in database (#5753)



> testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure 
> in CI
> ---
>
> Key: GEODE-8706
> URL: https://issues.apache.org/jira/browse/GEODE-8706
> Project: Geode
>  Issue Type: Bug
>Reporter: John Hutchison
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
> Fix For: 1.14.0
>
>
> __org.apache.geode.internal.cache.wan.offheap.SerialWANPropagationOffHeapDUnitTest
>  > testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode 
> FAILED
>  org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.wan.serial.SerialWANPropagationDUnitTest$$Lambda$181/1328086690.run
>  in VM 4 running on Host 3e4042fb1cd7 with 8 VMs
>  Caused by:
>  org.awaitility.core.ConditionTimeoutException: Assertion condition defined 
> as a lambda expression in org.apache.geode.internal.cache.wan.WANTestBase 
> that uses java.util.Set, java.util.Setint Expected queue entries: 0 but 
> actual entries: 8000 expected:<0> but was:<8000> within 5 minutes.
>  Caused by:
>  java.lang.AssertionError: Expected queue entries: 0 but actual entries: 8000 
> expected:<0> but was:<8000>
>  java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
>  Fix the strings or use IgnoredException.addIgnoredException to ignore.
>  --
>  Found suspect string in 'dunit_suspect-vm4.log' at line 68
>  [fatal 2020/11/13 18:09:30.989 GMT  for GatewaySender_ln> tid=619] Stopping the processor because the following 
> exception occurred while processing a batch: 
>  java.lang.NullPointerException
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.getConnection(GatewaySenderEventRemoteDispatcher.java:299)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.readAcknowledgement(GatewaySenderEventRemoteDispatcher.java:107)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher$AckReaderThread.run(GatewaySenderEventRemoteDispatcher.java:605)
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8444) When requester's RVV equals provider's rvvGC, should do fullGII

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233322#comment-17233322
 ] 

ASF subversion and git services commented on GEODE-8444:


Commit 86d34d8f6c631580ce7d0e69ef6c9c3be8b7dfd0 in geode's branch 
refs/heads/feature/GEODE-8444 from zhouxh
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=86d34d8 ]

GEODE-8444: When requester's RVV equals provider's rvvGC, should do fullGII


> When requester's RVV equals provider's rvvGC, should do fullGII 
> 
>
> Key: GEODE-8444
> URL: https://issues.apache.org/jira/browse/GEODE-8444
> Project: Geode
>  Issue Type: Bug
>Reporter: Xiaojian Zhou
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
>
> In current design, when requester's RVV equals provider's rvvGC, we treated 
> as dominated. 
> But if the last operation at provider is a destroy and the tombstone is GCed 
> when requester is offline, the provider lost the information about this key 
> when GII. So we should do fullGII instead of deltaGII in this case. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8664) JGroups exception is not propagated

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233312#comment-17233312
 ] 

ASF subversion and git services commented on GEODE-8664:


Commit f6605e0820ba9b858b8128cd31a03a29067b7710 in geode's branch 
refs/heads/feature/GEODE-8444 from Mario Salazar de Torres
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=f6605e0 ]

GEODE-8664: Nest errors in DistributionImpl.start (#5725)

 - If while calling DistributionImpl.start there was either a
   MembershipConfigurationException or MemberStartupException exceptions, its
   original cause was not propagated, therefore being unable to properly
   tackle issues on startup.
 - This commit propagates both exceptions.
 - Also 2 junit test were added to make sure this is working.


> JGroups exception is not propagated
> ---
>
> Key: GEODE-8664
> URL: https://issues.apache.org/jira/browse/GEODE-8664
> Project: Geode
>  Issue Type: Bug
>  Components: membership, messaging
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Mario Salazar de Torres
>Assignee: Mario Salazar de Torres
>Priority: Minor
>  Labels: pull-request-available
> Attachments: GEODE-8664.patch
>
>
> *AS A* geode contributor
> *I WANT* exceptions thrown in DistributionImpl.start to be propagated
> *SO THAT* more information is provided while tackling issues.
>  
> 
> *Additional information:*
> After looking at an issue while starting a locator having to do with 
> JGroupMessenger I noticed that exceptions from Membership.start are not 
> correctly propagated in DistributionImpl.start method.
> To ilustrate the problem I attach below the reported exception while starting 
> the locator:
>  
> {noformat}
> locator is exiting due to an exception
> org.apache.geode.GemFireConfigException: unable to create jgroups channel
> at 
> org.apache.geode.distributed.internal.DistributionImpl.start(DistributionImpl.java:184)
> at 
> org.apache.geode.distributed.internal.DistributionImpl.createDistribution(DistributionImpl.java:222)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:464)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:497)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.create(ClusterDistributionManager.java:326)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:779)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:135)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3033)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:290)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startDistributedSystem(InternalLocator.java:743)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:388)
> at 
> org.apache.geode.distributed.LocatorLauncher.start(LocatorLauncher.java:716)
> at org.apache.geode.distributed.LocatorLauncher.run(LocatorLauncher.java:623)
> at 
> org.apache.geode.distributed.LocatorLauncher.main(LocatorLauncher.java:217){noformat}
>  
> Thing is with that kind of information there's no way to know why the problem 
> is happening, so the idea would be to propagate the exceptions, so it looks 
> more like this:
> {noformat}
> locator is exiting due to an exception
> org.apache.geode.SystemConnectException: unable to create jgroups channel
> at 
> org.apache.geode.distributed.internal.DistributionImpl.start(DistributionImpl.java:186)
> at 
> org.apache.geode.distributed.internal.DistributionImpl.createDistribution(DistributionImpl.java:222)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:464)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:497)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.create(ClusterDistributionManager.java:326)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:779)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:135)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3034)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:290)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startDistributedSystem(InternalLocat

[jira] [Commented] (GEODE-8664) JGroups exception is not propagated

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233329#comment-17233329
 ] 

ASF subversion and git services commented on GEODE-8664:


Commit f6605e0820ba9b858b8128cd31a03a29067b7710 in geode's branch 
refs/heads/feature/GEODE-8444 from Mario Salazar de Torres
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=f6605e0 ]

GEODE-8664: Nest errors in DistributionImpl.start (#5725)

 - If while calling DistributionImpl.start there was either a
   MembershipConfigurationException or MemberStartupException exceptions, its
   original cause was not propagated, therefore being unable to properly
   tackle issues on startup.
 - This commit propagates both exceptions.
 - Also 2 junit test were added to make sure this is working.


> JGroups exception is not propagated
> ---
>
> Key: GEODE-8664
> URL: https://issues.apache.org/jira/browse/GEODE-8664
> Project: Geode
>  Issue Type: Bug
>  Components: membership, messaging
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Mario Salazar de Torres
>Assignee: Mario Salazar de Torres
>Priority: Minor
>  Labels: pull-request-available
> Attachments: GEODE-8664.patch
>
>
> *AS A* geode contributor
> *I WANT* exceptions thrown in DistributionImpl.start to be propagated
> *SO THAT* more information is provided while tackling issues.
>  
> 
> *Additional information:*
> After looking at an issue while starting a locator having to do with 
> JGroupMessenger I noticed that exceptions from Membership.start are not 
> correctly propagated in DistributionImpl.start method.
> To ilustrate the problem I attach below the reported exception while starting 
> the locator:
>  
> {noformat}
> locator is exiting due to an exception
> org.apache.geode.GemFireConfigException: unable to create jgroups channel
> at 
> org.apache.geode.distributed.internal.DistributionImpl.start(DistributionImpl.java:184)
> at 
> org.apache.geode.distributed.internal.DistributionImpl.createDistribution(DistributionImpl.java:222)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:464)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:497)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.create(ClusterDistributionManager.java:326)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:779)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:135)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3033)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:290)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startDistributedSystem(InternalLocator.java:743)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:388)
> at 
> org.apache.geode.distributed.LocatorLauncher.start(LocatorLauncher.java:716)
> at org.apache.geode.distributed.LocatorLauncher.run(LocatorLauncher.java:623)
> at 
> org.apache.geode.distributed.LocatorLauncher.main(LocatorLauncher.java:217){noformat}
>  
> Thing is with that kind of information there's no way to know why the problem 
> is happening, so the idea would be to propagate the exceptions, so it looks 
> more like this:
> {noformat}
> locator is exiting due to an exception
> org.apache.geode.SystemConnectException: unable to create jgroups channel
> at 
> org.apache.geode.distributed.internal.DistributionImpl.start(DistributionImpl.java:186)
> at 
> org.apache.geode.distributed.internal.DistributionImpl.createDistribution(DistributionImpl.java:222)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:464)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:497)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.create(ClusterDistributionManager.java:326)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:779)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:135)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3034)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:290)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startDistributedSystem(InternalLocat

[jira] [Commented] (GEODE-8697) Propagate ForcedDisconnectException to the user application in a network partition scenario

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1725#comment-1725
 ] 

ASF subversion and git services commented on GEODE-8697:


Commit 403e19c0a2b85369274e8254c16e0ae508b82e94 in geode's branch 
refs/heads/feature/GEODE-8444 from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=403e19c ]

GEODE-8697: Propagate ForcedDisconnectException to the user application (#5739)

* GEODE-8697: Propagate ForcedDisconnectException to the user application

avoid setting MemberDisconnectedException as a rootCause in
ClusterDistributionManager, even temporarily, as it's an internal
exception that should not be exposed to applications.

* addressing Ernie's comments

> Propagate ForcedDisconnectException to the user application in a network 
> partition scenario
> ---
>
> Key: GEODE-8697
> URL: https://issues.apache.org/jira/browse/GEODE-8697
> Project: Geode
>  Issue Type: Bug
>  Components: membership
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Kamilla Aslami
>Assignee: Bruce J Schuchardt
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> During network partitioning, we expect that the coordinator closes its 
> cluster with a ForcedDisconnectException. However, in some cases, threads end 
> up with a MemberDisconnectedException.
> System logs show that a ForcedDisconnect has happened:
> {code:java}
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.forceDisconnect(GMSMembership.java:2007)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1085)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:1422)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1327)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1266)
>  at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
>  at org.jgroups.JChannel.up(JChannel.java:741)
>  at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
>  at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
>  at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
>  at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1077)
>  at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:792)
>  at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:433)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:72)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:70)
>  at org.jgroups.protocols.TP.passMessageUp(TP.java:1658)
>  at org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1876)
>  at org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
>  at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1789)
>  at org.jgroups.protocols.TP.receive(TP.java:1714)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:159)
>  at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
>  at java.lang.Thread.run(Thread.java:748){code}
> But it is never propagated upwards to the user application:
> {code:java}
> org.apache.geode.distributed.DistributedSystemDisconnectedException: This 
> connection to a distributed system has been disconnected., caused by 
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.checkConnected(InternalDistributedSystem.java:978)
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.getDistributionManager(InternalDistributedSystem.java:1679)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.getDistributionManager(ReplyProcessor21.java:366)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.postWait(ReplyProcessor21.java:600)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterruptibly(ReplyProcessor21.java:824)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterrup

[jira] [Commented] (GEODE-8692) ArrayIndexOutOfBoundsException may be thrown in RegionAdvisor.processProfilesQueuedDuringInitialization

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233328#comment-17233328
 ] 

ASF subversion and git services commented on GEODE-8692:


Commit c99087aeb19abfb5bbd57036349870a6d784df1a in geode's branch 
refs/heads/feature/GEODE-8444 from Sarah
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=c99087a ]

GEODE-8692: ArrayIndexOutOfBoundsException may be thrown in 
RegionAdvisor.processProfilesQueuedDuringInitialization (#5722)



> ArrayIndexOutOfBoundsException may be thrown in 
> RegionAdvisor.processProfilesQueuedDuringInitialization
> ---
>
> Key: GEODE-8692
> URL: https://issues.apache.org/jira/browse/GEODE-8692
> Project: Geode
>  Issue Type: Bug
>Reporter: Sarah Abbey
>Assignee: Sarah Abbey
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> If {{RegionAdvisor.buckets == null}} at the time the value of {{serials}} is 
> generated, an {{ArrayIndexOutOfBoundsException}} may be thrown in 
> {{RegionAdvisor.processProfilesQueuedDuringInitialization}} if buckets have 
> been initialized at that point.
> Code where {{ArrayIndexOutOfBoundsException}} might be thrown in 
> {{RegionAdvisor.processProfilesQueuedDuringInitialization():}}
> {code:java}
> for (int i = 0; i < buckets.length; i++) {
>   BucketAdvisor ba = buckets[i].getBucketAdvisor();
>   int serial = qbp.serials[i]; <<< Exception thrown here
>   if (serial != ILLEGAL_SERIAL) {
> ba.removeIdWithSerial(qbp.memberId, serial, qbp.destroyed);
>   }
> }
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8672) Concurrent transactional destroy with GII could cause an entry to be removed and version information to be lost

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1722#comment-1722
 ] 

ASF subversion and git services commented on GEODE-8672:


Commit 7a55cba21cbe1fd22668c014a8a96ff161a75820 in geode's branch 
refs/heads/feature/GEODE-8444 from Eric Shu
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=7a55cba ]

GEODE-8672: No need in token mode if concurrencyChecksEnabled (#5746)

  * The DESTROYED token is only needed to prevent concurrent destroy op
is lost in GII. If concurrency checks are enabled, the version tag
should be able to prevent the destroy op being lost.
  * Correctly set the inTokenMode to avoid a potential hang in replicate
region as invokeTXCallbacks needs to wait for the region initialization
finished, but region intialization could not due to lock held by
threads invoking callbacks.


> Concurrent transactional destroy with GII could cause an entry to be removed 
> and version information to be lost
> ---
>
> Key: GEODE-8672
> URL: https://issues.apache.org/jira/browse/GEODE-8672
> Project: Geode
>  Issue Type: Bug
>  Components: regions
>Affects Versions: 1.1.0
>Reporter: Eric Shu
>Assignee: Eric Shu
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> In a newly rebalanced bucket, while GII is in progress, a transactional 
> destroy is applied to cache. There is a logic that it should be in token mode 
> and leaves the entry as a Destroyed token, even though the version tag of the 
> entry indicates that it has the correct version.
> However, at end of the GII, there is a 
> cleanUpDestroyedTokensAndMarkGIIComplete method removes all the destroyed 
> entries – this wipes off the entry version tag information and cause the 
> subsequent creates starts fresh with new version tags.
> This could leads to client server data inconsistency as the newly created 
> entries will be ignored by the clients as the newly created entry has lower 
> version number while client has high ones.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8670) ReconnectDUnitTest is hiding a NullPointerException

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1723#comment-1723
 ] 

ASF subversion and git services commented on GEODE-8670:


Commit 99e7a138064419c9a676411252e64077c2a1180f in geode's branch 
refs/heads/feature/GEODE-8444 from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=99e7a13 ]

GEODE-8670: ReconnectDUnitTest is hiding a NullPointerException (#5744)

* GEODE-8670: ReconnectDUnitTest is hiding a NullPointerException

1. Fixed all of the IgnorableException problems in ReconnectDUnitTest
2. Modified InternalLocator to avoid throwing a NullPointerException and
   added a test for this change

* fixing invalid test - had to move it to a new class to use SystemOutRule

the rule wasn't working for some reason when I added it to
InternalLocatorIntegrationTest

* removed use of SystemOutRule, which wasn't working in Stress tests

> ReconnectDUnitTest is hiding a NullPointerException
> ---
>
> Key: GEODE-8670
> URL: https://issues.apache.org/jira/browse/GEODE-8670
> Project: Geode
>  Issue Type: Test
>  Components: membership, tests
>Affects Versions: 1.14.0
>Reporter: Jens Deppe
>Assignee: Bruce J Schuchardt
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> Since working on GEODE-8609, I found that the regex in ReconnectDUnitTest is 
> incorrect. Specifically it looks like this:
> {noformat}
> IgnoredException.addIgnoredException("ForcedDisconnectException||Possible 
> loss of quorum");
> {noformat}
> When removing the double pipe ({{||}}) various test methods fail with a 
> {{NullPointerException}}. For example:
> {noformat}
> java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
> Fix the strings or use IgnoredException.addIgnoredException to ignore.
> ---
> Found suspect string in 'dunit_suspect-vm1.log' at line 692
> [fatal 2020/10/29 05:59:26.027 PDT  
> tid=203] Uncaught exception in thread Thread[Location services restart 
> thread,5,RMI Runtime]
> java.lang.NullPointerException
>   at 
> org.apache.geode.distributed.internal.InternalLocator.attemptReconnect(InternalLocator.java:1108)
>   at 
> org.apache.geode.distributed.internal.InternalLocator.lambda$launchRestartThread$4(InternalLocator.java:1070)
>   at java.lang.Thread.run(Thread.java:748)
> {noformat}
> There is also an additional {{DistributedSystemDisconnectedException}} which 
> probably just requires an addition to the other ignored exceptions.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8697) Propagate ForcedDisconnectException to the user application in a network partition scenario

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1726#comment-1726
 ] 

ASF subversion and git services commented on GEODE-8697:


Commit 403e19c0a2b85369274e8254c16e0ae508b82e94 in geode's branch 
refs/heads/feature/GEODE-8444 from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=403e19c ]

GEODE-8697: Propagate ForcedDisconnectException to the user application (#5739)

* GEODE-8697: Propagate ForcedDisconnectException to the user application

avoid setting MemberDisconnectedException as a rootCause in
ClusterDistributionManager, even temporarily, as it's an internal
exception that should not be exposed to applications.

* addressing Ernie's comments

> Propagate ForcedDisconnectException to the user application in a network 
> partition scenario
> ---
>
> Key: GEODE-8697
> URL: https://issues.apache.org/jira/browse/GEODE-8697
> Project: Geode
>  Issue Type: Bug
>  Components: membership
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Kamilla Aslami
>Assignee: Bruce J Schuchardt
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> During network partitioning, we expect that the coordinator closes its 
> cluster with a ForcedDisconnectException. However, in some cases, threads end 
> up with a MemberDisconnectedException.
> System logs show that a ForcedDisconnect has happened:
> {code:java}
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.forceDisconnect(GMSMembership.java:2007)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1085)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:1422)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1327)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1266)
>  at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
>  at org.jgroups.JChannel.up(JChannel.java:741)
>  at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
>  at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
>  at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
>  at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1077)
>  at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:792)
>  at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:433)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:72)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:70)
>  at org.jgroups.protocols.TP.passMessageUp(TP.java:1658)
>  at org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1876)
>  at org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
>  at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1789)
>  at org.jgroups.protocols.TP.receive(TP.java:1714)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:159)
>  at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
>  at java.lang.Thread.run(Thread.java:748){code}
> But it is never propagated upwards to the user application:
> {code:java}
> org.apache.geode.distributed.DistributedSystemDisconnectedException: This 
> connection to a distributed system has been disconnected., caused by 
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.checkConnected(InternalDistributedSystem.java:978)
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.getDistributionManager(InternalDistributedSystem.java:1679)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.getDistributionManager(ReplyProcessor21.java:366)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.postWait(ReplyProcessor21.java:600)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterruptibly(ReplyProcessor21.java:824)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterrup

[jira] [Commented] (GEODE-8706) testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure in CI

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1727#comment-1727
 ] 

ASF subversion and git services commented on GEODE-8706:


Commit 1bf18b2e9d3bee8c75ec2acb2331a6c12cc886b6 in geode's branch 
refs/heads/feature/GEODE-8444 from John Hutchison
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=1bf18b2 ]

GEODE-8706: Redis INFO command 'Keyspace' section should not be present if no 
keys in database (#5753)



> testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure 
> in CI
> ---
>
> Key: GEODE-8706
> URL: https://issues.apache.org/jira/browse/GEODE-8706
> Project: Geode
>  Issue Type: Bug
>Reporter: John Hutchison
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
> Fix For: 1.14.0
>
>
> __org.apache.geode.internal.cache.wan.offheap.SerialWANPropagationOffHeapDUnitTest
>  > testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode 
> FAILED
>  org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.wan.serial.SerialWANPropagationDUnitTest$$Lambda$181/1328086690.run
>  in VM 4 running on Host 3e4042fb1cd7 with 8 VMs
>  Caused by:
>  org.awaitility.core.ConditionTimeoutException: Assertion condition defined 
> as a lambda expression in org.apache.geode.internal.cache.wan.WANTestBase 
> that uses java.util.Set, java.util.Setint Expected queue entries: 0 but 
> actual entries: 8000 expected:<0> but was:<8000> within 5 minutes.
>  Caused by:
>  java.lang.AssertionError: Expected queue entries: 0 but actual entries: 8000 
> expected:<0> but was:<8000>
>  java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
>  Fix the strings or use IgnoredException.addIgnoredException to ignore.
>  --
>  Found suspect string in 'dunit_suspect-vm4.log' at line 68
>  [fatal 2020/11/13 18:09:30.989 GMT  for GatewaySender_ln> tid=619] Stopping the processor because the following 
> exception occurred while processing a batch: 
>  java.lang.NullPointerException
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.getConnection(GatewaySenderEventRemoteDispatcher.java:299)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.readAcknowledgement(GatewaySenderEventRemoteDispatcher.java:107)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher$AckReaderThread.run(GatewaySenderEventRemoteDispatcher.java:605)
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8704) many CI failures in Jetty9CachingClientServerTest

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1721#comment-1721
 ] 

ASF subversion and git services commented on GEODE-8704:


Commit 8f8cc7b62be9e7d3a20f18e38fe2cff3228a55bb in geode's branch 
refs/heads/feature/GEODE-8444 from Sarah
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=8f8cc7b ]

GEODE-8704: many CI failures in Jetty9CachingClientServerTest (#5745)



> many CI failures in Jetty9CachingClientServerTest
> -
>
> Key: GEODE-8704
> URL: https://issues.apache.org/jira/browse/GEODE-8704
> Project: Geode
>  Issue Type: Bug
>  Components: http session
>Affects Versions: 1.14.0
>Reporter: Kamilla Aslami
>Assignee: Benjamin P Ross
>Priority: Major
>  Labels: pull-request-available
>
> A bunch of failures:
> {code:java}
> org.apache.geode.session.tests.Jetty9CachingClientServerTest > 
> shouldCacheSessionOnClient FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.internal.IdentifiableRunnable.run in VM 1 running 
> on Host e4bd15534025 with 4 VMs
> at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:623)
> at org.apache.geode.test.dunit.VM.invoke(VM.java:434)
> at 
> org.apache.geode.test.junit.rules.VMProvider.invoke(VMProvider.java:102)
> at 
> org.apache.geode.session.tests.Jetty9CachingClientServerTest.shouldCacheSessionOnClient(Jetty9CachingClientServerTest.java:57)
> Caused by:
> java.lang.IllegalStateException: Session is invalid
> at 
> org.apache.geode.modules.session.internal.filter.GemfireHttpSession.checkValid(GemfireHttpSession.java:317)
> at 
> org.apache.geode.modules.session.internal.filter.GemfireHttpSession.setAttribute(GemfireHttpSession.java:301)
> at 
> org.apache.geode.session.tests.Jetty9CachingClientServerTest.lambda$null$0(Jetty9CachingClientServerTest.java:60)
> at java.lang.Iterable.forEach(Iterable.java:75)
> at 
> java.util.Collections$UnmodifiableCollection.forEach(Collections.java:1085)
> at 
> org.apache.geode.session.tests.Jetty9CachingClientServerTest.lambda$shouldCacheSessionOnClient$6b5e8371$1(Jetty9CachingClientServerTest.java:60)org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > shouldNotLeaveNativeSessionInContainer FAILED
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldReplicateCookies FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> 
> java.util.concurrent.TimeoutExceptionorg.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldShareSessionExpirationReset FAILED
> org.junit.ComparisonFailure: 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldExpireInSetTimeframe FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldHavePersistentSessionData FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > failureShouldStillAllowOtherContainersDataAccess FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > invalidationShouldRemoveValueAccessForAllContainers FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.internal.IdentifiableRunnable.run in

[jira] [Commented] (GEODE-8664) JGroups exception is not propagated

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1720#comment-1720
 ] 

ASF subversion and git services commented on GEODE-8664:


Commit f23e01a32213245dc684b417d97a14cf5a026741 in geode's branch 
refs/heads/feature/GEODE-8444 from Mario Salazar de Torres
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=f23e01a ]

Revert "GEODE-8664: Nest errors in DistributionImpl.start (#5725)" (#5742)

This reverts commit f6605e0820ba9b858b8128cd31a03a29067b7710.

> JGroups exception is not propagated
> ---
>
> Key: GEODE-8664
> URL: https://issues.apache.org/jira/browse/GEODE-8664
> Project: Geode
>  Issue Type: Bug
>  Components: membership, messaging
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Mario Salazar de Torres
>Assignee: Mario Salazar de Torres
>Priority: Minor
>  Labels: pull-request-available
> Attachments: GEODE-8664.patch
>
>
> *AS A* geode contributor
> *I WANT* exceptions thrown in DistributionImpl.start to be propagated
> *SO THAT* more information is provided while tackling issues.
>  
> 
> *Additional information:*
> After looking at an issue while starting a locator having to do with 
> JGroupMessenger I noticed that exceptions from Membership.start are not 
> correctly propagated in DistributionImpl.start method.
> To ilustrate the problem I attach below the reported exception while starting 
> the locator:
>  
> {noformat}
> locator is exiting due to an exception
> org.apache.geode.GemFireConfigException: unable to create jgroups channel
> at 
> org.apache.geode.distributed.internal.DistributionImpl.start(DistributionImpl.java:184)
> at 
> org.apache.geode.distributed.internal.DistributionImpl.createDistribution(DistributionImpl.java:222)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:464)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:497)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.create(ClusterDistributionManager.java:326)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:779)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:135)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3033)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:290)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startDistributedSystem(InternalLocator.java:743)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:388)
> at 
> org.apache.geode.distributed.LocatorLauncher.start(LocatorLauncher.java:716)
> at org.apache.geode.distributed.LocatorLauncher.run(LocatorLauncher.java:623)
> at 
> org.apache.geode.distributed.LocatorLauncher.main(LocatorLauncher.java:217){noformat}
>  
> Thing is with that kind of information there's no way to know why the problem 
> is happening, so the idea would be to propagate the exceptions, so it looks 
> more like this:
> {noformat}
> locator is exiting due to an exception
> org.apache.geode.SystemConnectException: unable to create jgroups channel
> at 
> org.apache.geode.distributed.internal.DistributionImpl.start(DistributionImpl.java:186)
> at 
> org.apache.geode.distributed.internal.DistributionImpl.createDistribution(DistributionImpl.java:222)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:464)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:497)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.create(ClusterDistributionManager.java:326)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:779)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:135)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3034)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:290)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startDistributedSystem(InternalLocator.java:743)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:388)
> at 
> org.apache.geode.distributed.LocatorLauncher.start(LocatorLauncher.java:716)
> at org.apache.geode.distributed.LocatorLauncher.run(LocatorLauncher.

[jira] [Commented] (GEODE-8670) ReconnectDUnitTest is hiding a NullPointerException

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1724#comment-1724
 ] 

ASF subversion and git services commented on GEODE-8670:


Commit 99e7a138064419c9a676411252e64077c2a1180f in geode's branch 
refs/heads/feature/GEODE-8444 from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=99e7a13 ]

GEODE-8670: ReconnectDUnitTest is hiding a NullPointerException (#5744)

* GEODE-8670: ReconnectDUnitTest is hiding a NullPointerException

1. Fixed all of the IgnorableException problems in ReconnectDUnitTest
2. Modified InternalLocator to avoid throwing a NullPointerException and
   added a test for this change

* fixing invalid test - had to move it to a new class to use SystemOutRule

the rule wasn't working for some reason when I added it to
InternalLocatorIntegrationTest

* removed use of SystemOutRule, which wasn't working in Stress tests

> ReconnectDUnitTest is hiding a NullPointerException
> ---
>
> Key: GEODE-8670
> URL: https://issues.apache.org/jira/browse/GEODE-8670
> Project: Geode
>  Issue Type: Test
>  Components: membership, tests
>Affects Versions: 1.14.0
>Reporter: Jens Deppe
>Assignee: Bruce J Schuchardt
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> Since working on GEODE-8609, I found that the regex in ReconnectDUnitTest is 
> incorrect. Specifically it looks like this:
> {noformat}
> IgnoredException.addIgnoredException("ForcedDisconnectException||Possible 
> loss of quorum");
> {noformat}
> When removing the double pipe ({{||}}) various test methods fail with a 
> {{NullPointerException}}. For example:
> {noformat}
> java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
> Fix the strings or use IgnoredException.addIgnoredException to ignore.
> ---
> Found suspect string in 'dunit_suspect-vm1.log' at line 692
> [fatal 2020/10/29 05:59:26.027 PDT  
> tid=203] Uncaught exception in thread Thread[Location services restart 
> thread,5,RMI Runtime]
> java.lang.NullPointerException
>   at 
> org.apache.geode.distributed.internal.InternalLocator.attemptReconnect(InternalLocator.java:1108)
>   at 
> org.apache.geode.distributed.internal.InternalLocator.lambda$launchRestartThread$4(InternalLocator.java:1070)
>   at java.lang.Thread.run(Thread.java:748)
> {noformat}
> There is also an additional {{DistributedSystemDisconnectedException}} which 
> probably just requires an addition to the other ignored exceptions.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8444) When requester's RVV equals provider's rvvGC, should do fullGII

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1729#comment-1729
 ] 

ASF subversion and git services commented on GEODE-8444:


Commit 86d34d8f6c631580ce7d0e69ef6c9c3be8b7dfd0 in geode's branch 
refs/heads/feature/GEODE-8444 from zhouxh
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=86d34d8 ]

GEODE-8444: When requester's RVV equals provider's rvvGC, should do fullGII


> When requester's RVV equals provider's rvvGC, should do fullGII 
> 
>
> Key: GEODE-8444
> URL: https://issues.apache.org/jira/browse/GEODE-8444
> Project: Geode
>  Issue Type: Bug
>Reporter: Xiaojian Zhou
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
>
> In current design, when requester's RVV equals provider's rvvGC, we treated 
> as dominated. 
> But if the last operation at provider is a destroy and the tombstone is GCed 
> when requester is offline, the provider lost the information about this key 
> when GII. So we should do fullGII instead of deltaGII in this case. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8706) testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure in CI

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1728#comment-1728
 ] 

ASF subversion and git services commented on GEODE-8706:


Commit 399b875b493060d4acff0d1b41bdb37401181439 in geode's branch 
refs/heads/feature/GEODE-8444 from Xiaojian Zhou
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=399b875 ]

GEODE-8706: getConnection should get readLock to sync with destroyCon… (#5750)



> testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure 
> in CI
> ---
>
> Key: GEODE-8706
> URL: https://issues.apache.org/jira/browse/GEODE-8706
> Project: Geode
>  Issue Type: Bug
>Reporter: John Hutchison
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
> Fix For: 1.14.0
>
>
> __org.apache.geode.internal.cache.wan.offheap.SerialWANPropagationOffHeapDUnitTest
>  > testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode 
> FAILED
>  org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.wan.serial.SerialWANPropagationDUnitTest$$Lambda$181/1328086690.run
>  in VM 4 running on Host 3e4042fb1cd7 with 8 VMs
>  Caused by:
>  org.awaitility.core.ConditionTimeoutException: Assertion condition defined 
> as a lambda expression in org.apache.geode.internal.cache.wan.WANTestBase 
> that uses java.util.Set, java.util.Setint Expected queue entries: 0 but 
> actual entries: 8000 expected:<0> but was:<8000> within 5 minutes.
>  Caused by:
>  java.lang.AssertionError: Expected queue entries: 0 but actual entries: 8000 
> expected:<0> but was:<8000>
>  java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
>  Fix the strings or use IgnoredException.addIgnoredException to ignore.
>  --
>  Found suspect string in 'dunit_suspect-vm4.log' at line 68
>  [fatal 2020/11/13 18:09:30.989 GMT  for GatewaySender_ln> tid=619] Stopping the processor because the following 
> exception occurred while processing a batch: 
>  java.lang.NullPointerException
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.getConnection(GatewaySenderEventRemoteDispatcher.java:299)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.readAcknowledgement(GatewaySenderEventRemoteDispatcher.java:107)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher$AckReaderThread.run(GatewaySenderEventRemoteDispatcher.java:605)
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8664) JGroups exception is not propagated

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233345#comment-17233345
 ] 

ASF subversion and git services commented on GEODE-8664:


Commit f23e01a32213245dc684b417d97a14cf5a026741 in geode's branch 
refs/heads/feature/GEODE-8444 from Mario Salazar de Torres
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=f23e01a ]

Revert "GEODE-8664: Nest errors in DistributionImpl.start (#5725)" (#5742)

This reverts commit f6605e0820ba9b858b8128cd31a03a29067b7710.

> JGroups exception is not propagated
> ---
>
> Key: GEODE-8664
> URL: https://issues.apache.org/jira/browse/GEODE-8664
> Project: Geode
>  Issue Type: Bug
>  Components: membership, messaging
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Mario Salazar de Torres
>Assignee: Mario Salazar de Torres
>Priority: Minor
>  Labels: pull-request-available
> Attachments: GEODE-8664.patch
>
>
> *AS A* geode contributor
> *I WANT* exceptions thrown in DistributionImpl.start to be propagated
> *SO THAT* more information is provided while tackling issues.
>  
> 
> *Additional information:*
> After looking at an issue while starting a locator having to do with 
> JGroupMessenger I noticed that exceptions from Membership.start are not 
> correctly propagated in DistributionImpl.start method.
> To ilustrate the problem I attach below the reported exception while starting 
> the locator:
>  
> {noformat}
> locator is exiting due to an exception
> org.apache.geode.GemFireConfigException: unable to create jgroups channel
> at 
> org.apache.geode.distributed.internal.DistributionImpl.start(DistributionImpl.java:184)
> at 
> org.apache.geode.distributed.internal.DistributionImpl.createDistribution(DistributionImpl.java:222)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:464)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:497)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.create(ClusterDistributionManager.java:326)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:779)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:135)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3033)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:290)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startDistributedSystem(InternalLocator.java:743)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:388)
> at 
> org.apache.geode.distributed.LocatorLauncher.start(LocatorLauncher.java:716)
> at org.apache.geode.distributed.LocatorLauncher.run(LocatorLauncher.java:623)
> at 
> org.apache.geode.distributed.LocatorLauncher.main(LocatorLauncher.java:217){noformat}
>  
> Thing is with that kind of information there's no way to know why the problem 
> is happening, so the idea would be to propagate the exceptions, so it looks 
> more like this:
> {noformat}
> locator is exiting due to an exception
> org.apache.geode.SystemConnectException: unable to create jgroups channel
> at 
> org.apache.geode.distributed.internal.DistributionImpl.start(DistributionImpl.java:186)
> at 
> org.apache.geode.distributed.internal.DistributionImpl.createDistribution(DistributionImpl.java:222)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:464)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:497)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.create(ClusterDistributionManager.java:326)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:779)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:135)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3034)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:290)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startDistributedSystem(InternalLocator.java:743)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:388)
> at 
> org.apache.geode.distributed.LocatorLauncher.start(LocatorLauncher.java:716)
> at org.apache.geode.distributed.LocatorLauncher.run(LocatorLauncher.

[jira] [Commented] (GEODE-8692) ArrayIndexOutOfBoundsException may be thrown in RegionAdvisor.processProfilesQueuedDuringInitialization

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233343#comment-17233343
 ] 

ASF subversion and git services commented on GEODE-8692:


Commit c99087aeb19abfb5bbd57036349870a6d784df1a in geode's branch 
refs/heads/feature/GEODE-8444 from Sarah
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=c99087a ]

GEODE-8692: ArrayIndexOutOfBoundsException may be thrown in 
RegionAdvisor.processProfilesQueuedDuringInitialization (#5722)



> ArrayIndexOutOfBoundsException may be thrown in 
> RegionAdvisor.processProfilesQueuedDuringInitialization
> ---
>
> Key: GEODE-8692
> URL: https://issues.apache.org/jira/browse/GEODE-8692
> Project: Geode
>  Issue Type: Bug
>Reporter: Sarah Abbey
>Assignee: Sarah Abbey
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> If {{RegionAdvisor.buckets == null}} at the time the value of {{serials}} is 
> generated, an {{ArrayIndexOutOfBoundsException}} may be thrown in 
> {{RegionAdvisor.processProfilesQueuedDuringInitialization}} if buckets have 
> been initialized at that point.
> Code where {{ArrayIndexOutOfBoundsException}} might be thrown in 
> {{RegionAdvisor.processProfilesQueuedDuringInitialization():}}
> {code:java}
> for (int i = 0; i < buckets.length; i++) {
>   BucketAdvisor ba = buckets[i].getBucketAdvisor();
>   int serial = qbp.serials[i]; <<< Exception thrown here
>   if (serial != ILLEGAL_SERIAL) {
> ba.removeIdWithSerial(qbp.memberId, serial, qbp.destroyed);
>   }
> }
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8664) JGroups exception is not propagated

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8664?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233344#comment-17233344
 ] 

ASF subversion and git services commented on GEODE-8664:


Commit f6605e0820ba9b858b8128cd31a03a29067b7710 in geode's branch 
refs/heads/feature/GEODE-8444 from Mario Salazar de Torres
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=f6605e0 ]

GEODE-8664: Nest errors in DistributionImpl.start (#5725)

 - If while calling DistributionImpl.start there was either a
   MembershipConfigurationException or MemberStartupException exceptions, its
   original cause was not propagated, therefore being unable to properly
   tackle issues on startup.
 - This commit propagates both exceptions.
 - Also 2 junit test were added to make sure this is working.


> JGroups exception is not propagated
> ---
>
> Key: GEODE-8664
> URL: https://issues.apache.org/jira/browse/GEODE-8664
> Project: Geode
>  Issue Type: Bug
>  Components: membership, messaging
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Mario Salazar de Torres
>Assignee: Mario Salazar de Torres
>Priority: Minor
>  Labels: pull-request-available
> Attachments: GEODE-8664.patch
>
>
> *AS A* geode contributor
> *I WANT* exceptions thrown in DistributionImpl.start to be propagated
> *SO THAT* more information is provided while tackling issues.
>  
> 
> *Additional information:*
> After looking at an issue while starting a locator having to do with 
> JGroupMessenger I noticed that exceptions from Membership.start are not 
> correctly propagated in DistributionImpl.start method.
> To ilustrate the problem I attach below the reported exception while starting 
> the locator:
>  
> {noformat}
> locator is exiting due to an exception
> org.apache.geode.GemFireConfigException: unable to create jgroups channel
> at 
> org.apache.geode.distributed.internal.DistributionImpl.start(DistributionImpl.java:184)
> at 
> org.apache.geode.distributed.internal.DistributionImpl.createDistribution(DistributionImpl.java:222)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:464)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:497)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.create(ClusterDistributionManager.java:326)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:779)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:135)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3033)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:290)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startDistributedSystem(InternalLocator.java:743)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startLocator(InternalLocator.java:388)
> at 
> org.apache.geode.distributed.LocatorLauncher.start(LocatorLauncher.java:716)
> at org.apache.geode.distributed.LocatorLauncher.run(LocatorLauncher.java:623)
> at 
> org.apache.geode.distributed.LocatorLauncher.main(LocatorLauncher.java:217){noformat}
>  
> Thing is with that kind of information there's no way to know why the problem 
> is happening, so the idea would be to propagate the exceptions, so it looks 
> more like this:
> {noformat}
> locator is exiting due to an exception
> org.apache.geode.SystemConnectException: unable to create jgroups channel
> at 
> org.apache.geode.distributed.internal.DistributionImpl.start(DistributionImpl.java:186)
> at 
> org.apache.geode.distributed.internal.DistributionImpl.createDistribution(DistributionImpl.java:222)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:464)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.(ClusterDistributionManager.java:497)
> at 
> org.apache.geode.distributed.internal.ClusterDistributionManager.create(ClusterDistributionManager.java:326)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.initialize(InternalDistributedSystem.java:779)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.access$200(InternalDistributedSystem.java:135)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem$Builder.build(InternalDistributedSystem.java:3034)
> at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.connectInternal(InternalDistributedSystem.java:290)
> at 
> org.apache.geode.distributed.internal.InternalLocator.startDistributedSystem(InternalLocat

[jira] [Commented] (GEODE-8670) ReconnectDUnitTest is hiding a NullPointerException

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233348#comment-17233348
 ] 

ASF subversion and git services commented on GEODE-8670:


Commit 99e7a138064419c9a676411252e64077c2a1180f in geode's branch 
refs/heads/feature/GEODE-8444 from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=99e7a13 ]

GEODE-8670: ReconnectDUnitTest is hiding a NullPointerException (#5744)

* GEODE-8670: ReconnectDUnitTest is hiding a NullPointerException

1. Fixed all of the IgnorableException problems in ReconnectDUnitTest
2. Modified InternalLocator to avoid throwing a NullPointerException and
   added a test for this change

* fixing invalid test - had to move it to a new class to use SystemOutRule

the rule wasn't working for some reason when I added it to
InternalLocatorIntegrationTest

* removed use of SystemOutRule, which wasn't working in Stress tests

> ReconnectDUnitTest is hiding a NullPointerException
> ---
>
> Key: GEODE-8670
> URL: https://issues.apache.org/jira/browse/GEODE-8670
> Project: Geode
>  Issue Type: Test
>  Components: membership, tests
>Affects Versions: 1.14.0
>Reporter: Jens Deppe
>Assignee: Bruce J Schuchardt
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> Since working on GEODE-8609, I found that the regex in ReconnectDUnitTest is 
> incorrect. Specifically it looks like this:
> {noformat}
> IgnoredException.addIgnoredException("ForcedDisconnectException||Possible 
> loss of quorum");
> {noformat}
> When removing the double pipe ({{||}}) various test methods fail with a 
> {{NullPointerException}}. For example:
> {noformat}
> java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
> Fix the strings or use IgnoredException.addIgnoredException to ignore.
> ---
> Found suspect string in 'dunit_suspect-vm1.log' at line 692
> [fatal 2020/10/29 05:59:26.027 PDT  
> tid=203] Uncaught exception in thread Thread[Location services restart 
> thread,5,RMI Runtime]
> java.lang.NullPointerException
>   at 
> org.apache.geode.distributed.internal.InternalLocator.attemptReconnect(InternalLocator.java:1108)
>   at 
> org.apache.geode.distributed.internal.InternalLocator.lambda$launchRestartThread$4(InternalLocator.java:1070)
>   at java.lang.Thread.run(Thread.java:748)
> {noformat}
> There is also an additional {{DistributedSystemDisconnectedException}} which 
> probably just requires an addition to the other ignored exceptions.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8444) When requester's RVV equals provider's rvvGC, should do fullGII

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233354#comment-17233354
 ] 

ASF subversion and git services commented on GEODE-8444:


Commit 86d34d8f6c631580ce7d0e69ef6c9c3be8b7dfd0 in geode's branch 
refs/heads/feature/GEODE-8444 from zhouxh
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=86d34d8 ]

GEODE-8444: When requester's RVV equals provider's rvvGC, should do fullGII


> When requester's RVV equals provider's rvvGC, should do fullGII 
> 
>
> Key: GEODE-8444
> URL: https://issues.apache.org/jira/browse/GEODE-8444
> Project: Geode
>  Issue Type: Bug
>Reporter: Xiaojian Zhou
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
>
> In current design, when requester's RVV equals provider's rvvGC, we treated 
> as dominated. 
> But if the last operation at provider is a destroy and the tombstone is GCed 
> when requester is offline, the provider lost the information about this key 
> when GII. So we should do fullGII instead of deltaGII in this case. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8706) testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure in CI

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233353#comment-17233353
 ] 

ASF subversion and git services commented on GEODE-8706:


Commit 399b875b493060d4acff0d1b41bdb37401181439 in geode's branch 
refs/heads/feature/GEODE-8444 from Xiaojian Zhou
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=399b875 ]

GEODE-8706: getConnection should get readLock to sync with destroyCon… (#5750)



> testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure 
> in CI
> ---
>
> Key: GEODE-8706
> URL: https://issues.apache.org/jira/browse/GEODE-8706
> Project: Geode
>  Issue Type: Bug
>Reporter: John Hutchison
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
> Fix For: 1.14.0
>
>
> __org.apache.geode.internal.cache.wan.offheap.SerialWANPropagationOffHeapDUnitTest
>  > testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode 
> FAILED
>  org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.wan.serial.SerialWANPropagationDUnitTest$$Lambda$181/1328086690.run
>  in VM 4 running on Host 3e4042fb1cd7 with 8 VMs
>  Caused by:
>  org.awaitility.core.ConditionTimeoutException: Assertion condition defined 
> as a lambda expression in org.apache.geode.internal.cache.wan.WANTestBase 
> that uses java.util.Set, java.util.Setint Expected queue entries: 0 but 
> actual entries: 8000 expected:<0> but was:<8000> within 5 minutes.
>  Caused by:
>  java.lang.AssertionError: Expected queue entries: 0 but actual entries: 8000 
> expected:<0> but was:<8000>
>  java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
>  Fix the strings or use IgnoredException.addIgnoredException to ignore.
>  --
>  Found suspect string in 'dunit_suspect-vm4.log' at line 68
>  [fatal 2020/11/13 18:09:30.989 GMT  for GatewaySender_ln> tid=619] Stopping the processor because the following 
> exception occurred while processing a batch: 
>  java.lang.NullPointerException
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.getConnection(GatewaySenderEventRemoteDispatcher.java:299)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.readAcknowledgement(GatewaySenderEventRemoteDispatcher.java:107)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher$AckReaderThread.run(GatewaySenderEventRemoteDispatcher.java:605)
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8704) many CI failures in Jetty9CachingClientServerTest

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233346#comment-17233346
 ] 

ASF subversion and git services commented on GEODE-8704:


Commit 8f8cc7b62be9e7d3a20f18e38fe2cff3228a55bb in geode's branch 
refs/heads/feature/GEODE-8444 from Sarah
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=8f8cc7b ]

GEODE-8704: many CI failures in Jetty9CachingClientServerTest (#5745)



> many CI failures in Jetty9CachingClientServerTest
> -
>
> Key: GEODE-8704
> URL: https://issues.apache.org/jira/browse/GEODE-8704
> Project: Geode
>  Issue Type: Bug
>  Components: http session
>Affects Versions: 1.14.0
>Reporter: Kamilla Aslami
>Assignee: Benjamin P Ross
>Priority: Major
>  Labels: pull-request-available
>
> A bunch of failures:
> {code:java}
> org.apache.geode.session.tests.Jetty9CachingClientServerTest > 
> shouldCacheSessionOnClient FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.internal.IdentifiableRunnable.run in VM 1 running 
> on Host e4bd15534025 with 4 VMs
> at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:623)
> at org.apache.geode.test.dunit.VM.invoke(VM.java:434)
> at 
> org.apache.geode.test.junit.rules.VMProvider.invoke(VMProvider.java:102)
> at 
> org.apache.geode.session.tests.Jetty9CachingClientServerTest.shouldCacheSessionOnClient(Jetty9CachingClientServerTest.java:57)
> Caused by:
> java.lang.IllegalStateException: Session is invalid
> at 
> org.apache.geode.modules.session.internal.filter.GemfireHttpSession.checkValid(GemfireHttpSession.java:317)
> at 
> org.apache.geode.modules.session.internal.filter.GemfireHttpSession.setAttribute(GemfireHttpSession.java:301)
> at 
> org.apache.geode.session.tests.Jetty9CachingClientServerTest.lambda$null$0(Jetty9CachingClientServerTest.java:60)
> at java.lang.Iterable.forEach(Iterable.java:75)
> at 
> java.util.Collections$UnmodifiableCollection.forEach(Collections.java:1085)
> at 
> org.apache.geode.session.tests.Jetty9CachingClientServerTest.lambda$shouldCacheSessionOnClient$6b5e8371$1(Jetty9CachingClientServerTest.java:60)org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > shouldNotLeaveNativeSessionInContainer FAILED
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldReplicateCookies FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> 
> java.util.concurrent.TimeoutExceptionorg.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldShareSessionExpirationReset FAILED
> org.junit.ComparisonFailure: 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldExpireInSetTimeframe FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldHavePersistentSessionData FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > failureShouldStillAllowOtherContainersDataAccess FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > invalidationShouldRemoveValueAccessForAllContainers FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.internal.IdentifiableRunnable.run in

[jira] [Commented] (GEODE-8697) Propagate ForcedDisconnectException to the user application in a network partition scenario

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233350#comment-17233350
 ] 

ASF subversion and git services commented on GEODE-8697:


Commit 403e19c0a2b85369274e8254c16e0ae508b82e94 in geode's branch 
refs/heads/feature/GEODE-8444 from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=403e19c ]

GEODE-8697: Propagate ForcedDisconnectException to the user application (#5739)

* GEODE-8697: Propagate ForcedDisconnectException to the user application

avoid setting MemberDisconnectedException as a rootCause in
ClusterDistributionManager, even temporarily, as it's an internal
exception that should not be exposed to applications.

* addressing Ernie's comments

> Propagate ForcedDisconnectException to the user application in a network 
> partition scenario
> ---
>
> Key: GEODE-8697
> URL: https://issues.apache.org/jira/browse/GEODE-8697
> Project: Geode
>  Issue Type: Bug
>  Components: membership
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Kamilla Aslami
>Assignee: Bruce J Schuchardt
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> During network partitioning, we expect that the coordinator closes its 
> cluster with a ForcedDisconnectException. However, in some cases, threads end 
> up with a MemberDisconnectedException.
> System logs show that a ForcedDisconnect has happened:
> {code:java}
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.forceDisconnect(GMSMembership.java:2007)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1085)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:1422)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1327)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1266)
>  at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
>  at org.jgroups.JChannel.up(JChannel.java:741)
>  at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
>  at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
>  at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
>  at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1077)
>  at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:792)
>  at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:433)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:72)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:70)
>  at org.jgroups.protocols.TP.passMessageUp(TP.java:1658)
>  at org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1876)
>  at org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
>  at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1789)
>  at org.jgroups.protocols.TP.receive(TP.java:1714)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:159)
>  at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
>  at java.lang.Thread.run(Thread.java:748){code}
> But it is never propagated upwards to the user application:
> {code:java}
> org.apache.geode.distributed.DistributedSystemDisconnectedException: This 
> connection to a distributed system has been disconnected., caused by 
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.checkConnected(InternalDistributedSystem.java:978)
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.getDistributionManager(InternalDistributedSystem.java:1679)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.getDistributionManager(ReplyProcessor21.java:366)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.postWait(ReplyProcessor21.java:600)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterruptibly(ReplyProcessor21.java:824)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterrup

[jira] [Commented] (GEODE-8672) Concurrent transactional destroy with GII could cause an entry to be removed and version information to be lost

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233347#comment-17233347
 ] 

ASF subversion and git services commented on GEODE-8672:


Commit 7a55cba21cbe1fd22668c014a8a96ff161a75820 in geode's branch 
refs/heads/feature/GEODE-8444 from Eric Shu
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=7a55cba ]

GEODE-8672: No need in token mode if concurrencyChecksEnabled (#5746)

  * The DESTROYED token is only needed to prevent concurrent destroy op
is lost in GII. If concurrency checks are enabled, the version tag
should be able to prevent the destroy op being lost.
  * Correctly set the inTokenMode to avoid a potential hang in replicate
region as invokeTXCallbacks needs to wait for the region initialization
finished, but region intialization could not due to lock held by
threads invoking callbacks.


> Concurrent transactional destroy with GII could cause an entry to be removed 
> and version information to be lost
> ---
>
> Key: GEODE-8672
> URL: https://issues.apache.org/jira/browse/GEODE-8672
> Project: Geode
>  Issue Type: Bug
>  Components: regions
>Affects Versions: 1.1.0
>Reporter: Eric Shu
>Assignee: Eric Shu
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> In a newly rebalanced bucket, while GII is in progress, a transactional 
> destroy is applied to cache. There is a logic that it should be in token mode 
> and leaves the entry as a Destroyed token, even though the version tag of the 
> entry indicates that it has the correct version.
> However, at end of the GII, there is a 
> cleanUpDestroyedTokensAndMarkGIIComplete method removes all the destroyed 
> entries – this wipes off the entry version tag information and cause the 
> subsequent creates starts fresh with new version tags.
> This could leads to client server data inconsistency as the newly created 
> entries will be ignored by the clients as the newly created entry has lower 
> version number while client has high ones.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8706) testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure in CI

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233352#comment-17233352
 ] 

ASF subversion and git services commented on GEODE-8706:


Commit 1bf18b2e9d3bee8c75ec2acb2331a6c12cc886b6 in geode's branch 
refs/heads/feature/GEODE-8444 from John Hutchison
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=1bf18b2 ]

GEODE-8706: Redis INFO command 'Keyspace' section should not be present if no 
keys in database (#5753)



> testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode failure 
> in CI
> ---
>
> Key: GEODE-8706
> URL: https://issues.apache.org/jira/browse/GEODE-8706
> Project: Geode
>  Issue Type: Bug
>Reporter: John Hutchison
>Assignee: Xiaojian Zhou
>Priority: Major
>  Labels: GeodeOperationAPI, pull-request-available
> Fix For: 1.14.0
>
>
> __org.apache.geode.internal.cache.wan.offheap.SerialWANPropagationOffHeapDUnitTest
>  > testReplicatedSerialPropagationWithRemoteReceiverRestartedOnOtherNode 
> FAILED
>  org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.internal.cache.wan.serial.SerialWANPropagationDUnitTest$$Lambda$181/1328086690.run
>  in VM 4 running on Host 3e4042fb1cd7 with 8 VMs
>  Caused by:
>  org.awaitility.core.ConditionTimeoutException: Assertion condition defined 
> as a lambda expression in org.apache.geode.internal.cache.wan.WANTestBase 
> that uses java.util.Set, java.util.Setint Expected queue entries: 0 but 
> actual entries: 8000 expected:<0> but was:<8000> within 5 minutes.
>  Caused by:
>  java.lang.AssertionError: Expected queue entries: 0 but actual entries: 8000 
> expected:<0> but was:<8000>
>  java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
>  Fix the strings or use IgnoredException.addIgnoredException to ignore.
>  --
>  Found suspect string in 'dunit_suspect-vm4.log' at line 68
>  [fatal 2020/11/13 18:09:30.989 GMT  for GatewaySender_ln> tid=619] Stopping the processor because the following 
> exception occurred while processing a batch: 
>  java.lang.NullPointerException
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.getConnection(GatewaySenderEventRemoteDispatcher.java:299)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher.readAcknowledgement(GatewaySenderEventRemoteDispatcher.java:107)
>  at 
> org.apache.geode.internal.cache.wan.GatewaySenderEventRemoteDispatcher$AckReaderThread.run(GatewaySenderEventRemoteDispatcher.java:605)
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8697) Propagate ForcedDisconnectException to the user application in a network partition scenario

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233351#comment-17233351
 ] 

ASF subversion and git services commented on GEODE-8697:


Commit 403e19c0a2b85369274e8254c16e0ae508b82e94 in geode's branch 
refs/heads/feature/GEODE-8444 from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=403e19c ]

GEODE-8697: Propagate ForcedDisconnectException to the user application (#5739)

* GEODE-8697: Propagate ForcedDisconnectException to the user application

avoid setting MemberDisconnectedException as a rootCause in
ClusterDistributionManager, even temporarily, as it's an internal
exception that should not be exposed to applications.

* addressing Ernie's comments

> Propagate ForcedDisconnectException to the user application in a network 
> partition scenario
> ---
>
> Key: GEODE-8697
> URL: https://issues.apache.org/jira/browse/GEODE-8697
> Project: Geode
>  Issue Type: Bug
>  Components: membership
>Affects Versions: 1.12.0, 1.13.0
>Reporter: Kamilla Aslami
>Assignee: Bruce J Schuchardt
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> During network partitioning, we expect that the coordinator closes its 
> cluster with a ForcedDisconnectException. However, in some cases, threads end 
> up with a MemberDisconnectedException.
> System logs show that a ForcedDisconnect has happened:
> {code:java}
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.membership.gms.GMSMembership$ManagerImpl.forceDisconnect(GMSMembership.java:2007)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.forceDisconnect(GMSJoinLeave.java:1085)
>  at 
> org.apache.geode.distributed.internal.membership.gms.membership.GMSJoinLeave.processMessage(GMSJoinLeave.java:1422)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1327)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.JGroupsMessenger$JGroupsReceiver.receive(JGroupsMessenger.java:1266)
>  at org.jgroups.JChannel.invokeCallback(JChannel.java:816)
>  at org.jgroups.JChannel.up(JChannel.java:741)
>  at org.jgroups.stack.ProtocolStack.up(ProtocolStack.java:1030)
>  at org.jgroups.protocols.FRAG2.up(FRAG2.java:165)
>  at org.jgroups.protocols.FlowControl.up(FlowControl.java:390)
>  at org.jgroups.protocols.UNICAST3.deliverMessage(UNICAST3.java:1077)
>  at org.jgroups.protocols.UNICAST3.handleDataReceived(UNICAST3.java:792)
>  at org.jgroups.protocols.UNICAST3.up(UNICAST3.java:433)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.StatRecorder.up(StatRecorder.java:72)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.AddressManager.up(AddressManager.java:70)
>  at org.jgroups.protocols.TP.passMessageUp(TP.java:1658)
>  at org.jgroups.protocols.TP$SingleMessageHandler.run(TP.java:1876)
>  at org.jgroups.util.DirectExecutor.execute(DirectExecutor.java:10)
>  at org.jgroups.protocols.TP.handleSingleMessage(TP.java:1789)
>  at org.jgroups.protocols.TP.receive(TP.java:1714)
>  at 
> org.apache.geode.distributed.internal.membership.gms.messenger.Transport.receive(Transport.java:159)
>  at org.jgroups.protocols.UDP$PacketReceiver.run(UDP.java:701)
>  at java.lang.Thread.run(Thread.java:748){code}
> But it is never propagated upwards to the user application:
> {code:java}
> org.apache.geode.distributed.DistributedSystemDisconnectedException: This 
> connection to a distributed system has been disconnected., caused by 
> org.apache.geode.distributed.internal.membership.api.MemberDisconnectedException:
>  Membership coordinator 
> 10.32.111.185(gemfire3_host1_7340:7340:locator):41000 has declared 
> that a network partition has occurred
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.checkConnected(InternalDistributedSystem.java:978)
>  at 
> org.apache.geode.distributed.internal.InternalDistributedSystem.getDistributionManager(InternalDistributedSystem.java:1679)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.getDistributionManager(ReplyProcessor21.java:366)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.postWait(ReplyProcessor21.java:600)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterruptibly(ReplyProcessor21.java:824)
>  at 
> org.apache.geode.distributed.internal.ReplyProcessor21.waitForRepliesUninterrup

[jira] [Commented] (GEODE-8670) ReconnectDUnitTest is hiding a NullPointerException

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233361#comment-17233361
 ] 

ASF subversion and git services commented on GEODE-8670:


Commit 99e7a138064419c9a676411252e64077c2a1180f in geode's branch 
refs/heads/feature/GEODE-8444 from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=99e7a13 ]

GEODE-8670: ReconnectDUnitTest is hiding a NullPointerException (#5744)

* GEODE-8670: ReconnectDUnitTest is hiding a NullPointerException

1. Fixed all of the IgnorableException problems in ReconnectDUnitTest
2. Modified InternalLocator to avoid throwing a NullPointerException and
   added a test for this change

* fixing invalid test - had to move it to a new class to use SystemOutRule

the rule wasn't working for some reason when I added it to
InternalLocatorIntegrationTest

* removed use of SystemOutRule, which wasn't working in Stress tests

> ReconnectDUnitTest is hiding a NullPointerException
> ---
>
> Key: GEODE-8670
> URL: https://issues.apache.org/jira/browse/GEODE-8670
> Project: Geode
>  Issue Type: Test
>  Components: membership, tests
>Affects Versions: 1.14.0
>Reporter: Jens Deppe
>Assignee: Bruce J Schuchardt
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> Since working on GEODE-8609, I found that the regex in ReconnectDUnitTest is 
> incorrect. Specifically it looks like this:
> {noformat}
> IgnoredException.addIgnoredException("ForcedDisconnectException||Possible 
> loss of quorum");
> {noformat}
> When removing the double pipe ({{||}}) various test methods fail with a 
> {{NullPointerException}}. For example:
> {noformat}
> java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
> Fix the strings or use IgnoredException.addIgnoredException to ignore.
> ---
> Found suspect string in 'dunit_suspect-vm1.log' at line 692
> [fatal 2020/10/29 05:59:26.027 PDT  
> tid=203] Uncaught exception in thread Thread[Location services restart 
> thread,5,RMI Runtime]
> java.lang.NullPointerException
>   at 
> org.apache.geode.distributed.internal.InternalLocator.attemptReconnect(InternalLocator.java:1108)
>   at 
> org.apache.geode.distributed.internal.InternalLocator.lambda$launchRestartThread$4(InternalLocator.java:1070)
>   at java.lang.Thread.run(Thread.java:748)
> {noformat}
> There is also an additional {{DistributedSystemDisconnectedException}} which 
> probably just requires an addition to the other ignored exceptions.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8670) ReconnectDUnitTest is hiding a NullPointerException

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233362#comment-17233362
 ] 

ASF subversion and git services commented on GEODE-8670:


Commit 99e7a138064419c9a676411252e64077c2a1180f in geode's branch 
refs/heads/feature/GEODE-8444 from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=99e7a13 ]

GEODE-8670: ReconnectDUnitTest is hiding a NullPointerException (#5744)

* GEODE-8670: ReconnectDUnitTest is hiding a NullPointerException

1. Fixed all of the IgnorableException problems in ReconnectDUnitTest
2. Modified InternalLocator to avoid throwing a NullPointerException and
   added a test for this change

* fixing invalid test - had to move it to a new class to use SystemOutRule

the rule wasn't working for some reason when I added it to
InternalLocatorIntegrationTest

* removed use of SystemOutRule, which wasn't working in Stress tests

> ReconnectDUnitTest is hiding a NullPointerException
> ---
>
> Key: GEODE-8670
> URL: https://issues.apache.org/jira/browse/GEODE-8670
> Project: Geode
>  Issue Type: Test
>  Components: membership, tests
>Affects Versions: 1.14.0
>Reporter: Jens Deppe
>Assignee: Bruce J Schuchardt
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> Since working on GEODE-8609, I found that the regex in ReconnectDUnitTest is 
> incorrect. Specifically it looks like this:
> {noformat}
> IgnoredException.addIgnoredException("ForcedDisconnectException||Possible 
> loss of quorum");
> {noformat}
> When removing the double pipe ({{||}}) various test methods fail with a 
> {{NullPointerException}}. For example:
> {noformat}
> java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
> Fix the strings or use IgnoredException.addIgnoredException to ignore.
> ---
> Found suspect string in 'dunit_suspect-vm1.log' at line 692
> [fatal 2020/10/29 05:59:26.027 PDT  
> tid=203] Uncaught exception in thread Thread[Location services restart 
> thread,5,RMI Runtime]
> java.lang.NullPointerException
>   at 
> org.apache.geode.distributed.internal.InternalLocator.attemptReconnect(InternalLocator.java:1108)
>   at 
> org.apache.geode.distributed.internal.InternalLocator.lambda$launchRestartThread$4(InternalLocator.java:1070)
>   at java.lang.Thread.run(Thread.java:748)
> {noformat}
> There is also an additional {{DistributedSystemDisconnectedException}} which 
> probably just requires an addition to the other ignored exceptions.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8670) ReconnectDUnitTest is hiding a NullPointerException

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233349#comment-17233349
 ] 

ASF subversion and git services commented on GEODE-8670:


Commit 99e7a138064419c9a676411252e64077c2a1180f in geode's branch 
refs/heads/feature/GEODE-8444 from Bruce Schuchardt
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=99e7a13 ]

GEODE-8670: ReconnectDUnitTest is hiding a NullPointerException (#5744)

* GEODE-8670: ReconnectDUnitTest is hiding a NullPointerException

1. Fixed all of the IgnorableException problems in ReconnectDUnitTest
2. Modified InternalLocator to avoid throwing a NullPointerException and
   added a test for this change

* fixing invalid test - had to move it to a new class to use SystemOutRule

the rule wasn't working for some reason when I added it to
InternalLocatorIntegrationTest

* removed use of SystemOutRule, which wasn't working in Stress tests

> ReconnectDUnitTest is hiding a NullPointerException
> ---
>
> Key: GEODE-8670
> URL: https://issues.apache.org/jira/browse/GEODE-8670
> Project: Geode
>  Issue Type: Test
>  Components: membership, tests
>Affects Versions: 1.14.0
>Reporter: Jens Deppe
>Assignee: Bruce J Schuchardt
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.14.0
>
>
> Since working on GEODE-8609, I found that the regex in ReconnectDUnitTest is 
> incorrect. Specifically it looks like this:
> {noformat}
> IgnoredException.addIgnoredException("ForcedDisconnectException||Possible 
> loss of quorum");
> {noformat}
> When removing the double pipe ({{||}}) various test methods fail with a 
> {{NullPointerException}}. For example:
> {noformat}
> java.lang.AssertionError: Suspicious strings were written to the log during 
> this run.
> Fix the strings or use IgnoredException.addIgnoredException to ignore.
> ---
> Found suspect string in 'dunit_suspect-vm1.log' at line 692
> [fatal 2020/10/29 05:59:26.027 PDT  
> tid=203] Uncaught exception in thread Thread[Location services restart 
> thread,5,RMI Runtime]
> java.lang.NullPointerException
>   at 
> org.apache.geode.distributed.internal.InternalLocator.attemptReconnect(InternalLocator.java:1108)
>   at 
> org.apache.geode.distributed.internal.InternalLocator.lambda$launchRestartThread$4(InternalLocator.java:1070)
>   at java.lang.Thread.run(Thread.java:748)
> {noformat}
> There is also an additional {{DistributedSystemDisconnectedException}} which 
> probably just requires an addition to the other ignored exceptions.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (GEODE-8704) many CI failures in Jetty9CachingClientServerTest

2020-11-16 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/GEODE-8704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17233358#comment-17233358
 ] 

ASF subversion and git services commented on GEODE-8704:


Commit 8f8cc7b62be9e7d3a20f18e38fe2cff3228a55bb in geode's branch 
refs/heads/feature/GEODE-8444 from Sarah
[ https://gitbox.apache.org/repos/asf?p=geode.git;h=8f8cc7b ]

GEODE-8704: many CI failures in Jetty9CachingClientServerTest (#5745)



> many CI failures in Jetty9CachingClientServerTest
> -
>
> Key: GEODE-8704
> URL: https://issues.apache.org/jira/browse/GEODE-8704
> Project: Geode
>  Issue Type: Bug
>  Components: http session
>Affects Versions: 1.14.0
>Reporter: Kamilla Aslami
>Assignee: Benjamin P Ross
>Priority: Major
>  Labels: pull-request-available
>
> A bunch of failures:
> {code:java}
> org.apache.geode.session.tests.Jetty9CachingClientServerTest > 
> shouldCacheSessionOnClient FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.internal.IdentifiableRunnable.run in VM 1 running 
> on Host e4bd15534025 with 4 VMs
> at org.apache.geode.test.dunit.VM.executeMethodOnObject(VM.java:623)
> at org.apache.geode.test.dunit.VM.invoke(VM.java:434)
> at 
> org.apache.geode.test.junit.rules.VMProvider.invoke(VMProvider.java:102)
> at 
> org.apache.geode.session.tests.Jetty9CachingClientServerTest.shouldCacheSessionOnClient(Jetty9CachingClientServerTest.java:57)
> Caused by:
> java.lang.IllegalStateException: Session is invalid
> at 
> org.apache.geode.modules.session.internal.filter.GemfireHttpSession.checkValid(GemfireHttpSession.java:317)
> at 
> org.apache.geode.modules.session.internal.filter.GemfireHttpSession.setAttribute(GemfireHttpSession.java:301)
> at 
> org.apache.geode.session.tests.Jetty9CachingClientServerTest.lambda$null$0(Jetty9CachingClientServerTest.java:60)
> at java.lang.Iterable.forEach(Iterable.java:75)
> at 
> java.util.Collections$UnmodifiableCollection.forEach(Collections.java:1085)
> at 
> org.apache.geode.session.tests.Jetty9CachingClientServerTest.lambda$shouldCacheSessionOnClient$6b5e8371$1(Jetty9CachingClientServerTest.java:60)org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > shouldNotLeaveNativeSessionInContainer FAILED
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldReplicateCookies FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> 
> java.util.concurrent.TimeoutExceptionorg.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldShareSessionExpirationReset FAILED
> org.junit.ComparisonFailure: 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldExpireInSetTimeframe FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > containersShouldHavePersistentSessionData FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > failureShouldStillAllowOtherContainersDataAccess FAILED
> org.awaitility.core.ConditionTimeoutException: Assertion condition 
> defined as a lambda expression in 
> org.apache.geode.session.tests.CargoTestBase Sessions are not replicating 
> properly expected: but 
> was: within 5 minutes.
> Caused by:
> org.junit.ComparisonFailure: Sessions are not replicating properly 
> expected: but 
> was:org.apache.geode.session.tests.Jetty9CachingClientServerTest
>  > invalidationShouldRemoveValueAccessForAllContainers FAILED
> org.apache.geode.test.dunit.RMIException: While invoking 
> org.apache.geode.test.dunit.internal.IdentifiableRunnable.run in

  1   2   >