[ 
https://issues.apache.org/jira/browse/FLUME-3020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15652423#comment-15652423
 ] 

ASF subversion and git services commented on FLUME-3020:
--------------------------------------------------------

Commit 9868c4231362e5568a1675a604288d60cbadd7fe in flume's branch 
refs/heads/trunk from [~malaskat]
[ https://git-wip-us.apache.org/repos/asf?p=flume.git;h=9868c42 ]

FLUME-3020. Improve HDFS Sink escape sequence substitution

When using escape sequences, the current code will call InetAddress
for every event which results in a huge impact to performance.

This patch fixes that issue by caching the local host in a static variable.
We can do this because there is zero chance the local host will change for a 
life of a JVM.

Reviewers: Hari Shreedharan, Jeff Holoman, Bessenyei Balázs Donát

(Theodore michael Malaska via Bessenyei Balázs Donát)


> Improve HDFSEventSink Escape Ingestion by more then 10x by not getting 
> InetAddress on every record
> --------------------------------------------------------------------------------------------------
>
>                 Key: FLUME-3020
>                 URL: https://issues.apache.org/jira/browse/FLUME-3020
>             Project: Flume
>          Issue Type: Improvement
>            Reporter: Theodore michael Malaska
>            Assignee: Theodore michael Malaska
>             Fix For: v1.8.0
>
>         Attachments: flume-3020.patch, flume-3020.patch.2
>
>
> If you are use escaping the current code will call InetAddress on every 
> record.  Which will result is a huge impact to performance.
> TotalTime,8403,
> totalEventTakeTime,1498,
> totalWriteTime,1981,
> totalWriterSetupTime,65,
> commitTime,201,
> flushTime,18,
> startTrans,7,
> The rest is all InetAddress



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to