[
https://issues.apache.org/jira/browse/HADOOP-19394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18030434#comment-18030434
]
Ahmar Suhail commented on HADOOP-19394:
---------------------------------------
I've done some benchmarks against S3 express and AAL works fine there, since
it's just GETs I don't expect any issues. Will try running the S3A tests
against S3 express tomorrow.
> S3A Analytics Accelerator: vector IO support
> --------------------------------------------
>
> Key: HADOOP-19394
> URL: https://issues.apache.org/jira/browse/HADOOP-19394
> Project: Hadoop Common
> Issue Type: Sub-task
> Components: fs/s3
> Affects Versions: 3.4.1
> Reporter: Steve Loughran
> Priority: Major
> Labels: pull-request-available
> Fix For: 3.5.0, 3.4.3
>
>
> Add vector IO support for analytics accelerator stream
> Three stages
> # pull up s3a input stream to work with all ObjectInputStreams; do its own
> fetching independent of the analytics stream
> # provide info to stream of fetches having taken place (remove from cache,
> cancel prefetch)
> full integration
> * return a range from cache if present
> * append to the block retrieval callback if a prefetch is in progress
> * only do merge + new request if the range cannot be satisifed entirely from
> cached data
> Out of scope: handling case where part of a range is in cache/retrieval. Too
> complicated and so prone to problems.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]