patrick white created HADOOP-10554:
--------------------------------------
Summary: Performance: Scan metrics for 2.4 are notably down
compared to 0.23.9
Key: HADOOP-10554
URL: https://issues.apache.org/jira/browse/HADOOP-10554
Project: Hadoop Common
Issue Type: Bug
Affects Versions: 2.4.0
Reporter: patrick white
Performance comparison benchmarks for Scan test's runtime and throughput
metrics are slightly out of 5% tolerance in 2.x compared against 0.23. The
trend is consistent across later releases in both lines, latest release numbers
are;
Runtime:
2.4.0.0 -> 73.6 seconds (avg 5 passes)
0.23.9.12 -> 69.4 seconds (avg 5 passes)
Diff: -5.7%
Throughput:
2.4.0.0 -> 28.67 GB/s (avg 5 passes)
0.23.9.12 -> 30.41 GB/s (avg 5 passes)
Diff: -6.1%
Scan test is specifically measuring the average map's input read performance.
The diff is consistent when run on a larger (350 node) perf environment, we are
in process of seeing if this reproduces in a smaller cluster, using
appropriately scaled inputs.
--
This message was sent by Atlassian JIRA
(v6.2#6252)