[
https://issues.apache.org/jira/browse/FLUME-3020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15652423#comment-15652423
]
ASF subversion and git services commented on FLUME-3020:
--------------------------------------------------------
Commit 9868c4231362e5568a1675a604288d60cbadd7fe in flume's branch
refs/heads/trunk from [~malaskat]
[ https://git-wip-us.apache.org/repos/asf?p=flume.git;h=9868c42 ]
FLUME-3020. Improve HDFS Sink escape sequence substitution
When using escape sequences, the current code will call InetAddress
for every event which results in a huge impact to performance.
This patch fixes that issue by caching the local host in a static variable.
We can do this because there is zero chance the local host will change for a
life of a JVM.
Reviewers: Hari Shreedharan, Jeff Holoman, Bessenyei Balázs Donát
(Theodore michael Malaska via Bessenyei Balázs Donát)
> Improve HDFSEventSink Escape Ingestion by more then 10x by not getting
> InetAddress on every record
> --------------------------------------------------------------------------------------------------
>
> Key: FLUME-3020
> URL: https://issues.apache.org/jira/browse/FLUME-3020
> Project: Flume
> Issue Type: Improvement
> Reporter: Theodore michael Malaska
> Assignee: Theodore michael Malaska
> Fix For: v1.8.0
>
> Attachments: flume-3020.patch, flume-3020.patch.2
>
>
> If you are use escaping the current code will call InetAddress on every
> record. Which will result is a huge impact to performance.
> TotalTime,8403,
> totalEventTakeTime,1498,
> totalWriteTime,1981,
> totalWriterSetupTime,65,
> commitTime,201,
> flushTime,18,
> startTrans,7,
> The rest is all InetAddress
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)