[
https://issues.apache.org/jira/browse/HADOOP-19596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18012187#comment-18012187
]
Steve Loughran commented on HADOOP-19596:
-----------------------------------------
oh, and a PR for avro to say "avro file"
https://github.com/apache/avro/pull/1807
aggressive prefetching on files which say they will be read with random IO is
suboptimal
> ABFS: [ReadAheadV2] Increase Prefetch Aggressiveness to improve sequential
> read performance
> -------------------------------------------------------------------------------------------
>
> Key: HADOOP-19596
> URL: https://issues.apache.org/jira/browse/HADOOP-19596
> Project: Hadoop Common
> Issue Type: Sub-task
> Components: fs/azure
> Affects Versions: 3.5.0, 3.4.1
> Reporter: Anuj Modi
> Assignee: Anuj Modi
> Priority: Major
> Attachments: Read Buffer Manager V2.pdf
>
>
> Various analyses done in the past have shown a need for significant
> improvement in the performance of sequential reads. The current
> implementation clearly shows the lack of parallelism that is needed to cater
> to high throughput sequential read workloads.
> More details on updated design and results of POC benchmarking will be added
> here soon.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]