[ 
https://issues.apache.org/jira/browse/HADOOP-10090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13820421#comment-13820421
 ] 

Luke Lu commented on HADOOP-10090:
----------------------------------

I was aware of the suboptimal behavior and hoped it'd be OK for metrics, which 
don't require strong consistency.

#1 incurs unnecessary overhead (updating jmx cache) for people who don't use 
JMX. This is the reason of the current cache logic.
#2 is risky, as we don't know all existing jmx query patterns (especially due 
to HDFS-5333). User (admins) actually already has a choice to use a small JMX 
cache TTL for refreshness.

How about #3: we only initialize and update the JMX cache when JMX is first 
used and stops updating after a period inactivity and reinitialize and update 
JMX cache upon activity. Initialize/reinitialize is a "dense" update, while 
"update" means the "sparse" update with the current lastRecs mechanisms. I 
think #3 is should be a fairly straightforward patch and more flexible than #1 
and #2.


> Jobtracker metrics not updated properly after execution of a mapreduce job
> --------------------------------------------------------------------------
>
>                 Key: HADOOP-10090
>                 URL: https://issues.apache.org/jira/browse/HADOOP-10090
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: metrics
>    Affects Versions: 1.2.1
>            Reporter: Ivan Mitic
>            Assignee: Ivan Mitic
>         Attachments: OneBoxRepro.png
>
>
> After executing a wordcount mapreduce sample job, jobtracker metrics are not 
> updated properly. Often times the response from the jobtracker has higher 
> number of job_completed than job_submitted (for example 8 jobs completed and 
> 7 jobs submitted). 
> Issue reported by Toma Paunovic.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to