[
https://issues.apache.org/jira/browse/HADOOP-10079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13811649#comment-13811649
]
Colin Patrick McCabe commented on HADOOP-10079:
-----------------------------------------------
bq. Can you comment on how you chose the default value of 30s?
I was basing this timeout on {{ha.health-monitor.rpc-timeout.ms}}, which is
currently 45000 (45 seconds). I'll change it to 5 seconds in the interest of
detecting more performance problems.
bq. Do we need that new startMs variable when we have already have now (which
could be renamed startMs? I don't think the cached check contributes
meaningfully compared to the 30s timeout.
true, we only need to call {{Time#monotonicNow}} once
bq. Also, while we're in this file, how about moving the hardcoded default
value of HADOOP_SECURITY_GROUPS_CACHE_SECS into CommonConfigurationKeys?
OK
bq. Since we're now measure group resolution time, should we perhaps put this
into a percentile metric as well?
I took a look at doing this, but it got complex. For one thing,
{{dfs.metrics.percentiles.intervals}} is currently HDFS-only, not in common.
For another, the way the {{Groups}} singleton is created doesn't take into
account the process name, which we would need to add the right metrics. Let's
open a follow-up JIRA for this.
> log a warning message if group resolution takes too long.
> ---------------------------------------------------------
>
> Key: HADOOP-10079
> URL: https://issues.apache.org/jira/browse/HADOOP-10079
> Project: Hadoop Common
> Issue Type: Improvement
> Affects Versions: 2.2.0
> Reporter: Colin Patrick McCabe
> Assignee: Colin Patrick McCabe
> Attachments: HADOOP-10079.001.patch, HADOOP-10079.002.patch
>
>
> We should log a warning message if group resolution takes too long.
--
This message was sent by Atlassian JIRA
(v6.1#6144)