Re: [PR] Add new Directory implementation for AWS S3 [lucene]

2025-02-17 Thread via GitHub
github-actions[bot] commented on PR #13949: URL: https://github.com/apache/lucene/pull/13949#issuecomment-2664272130 This PR has not had activity in the past 2 weeks, labeling it as stale. If the PR is waiting for review, notify the d...@lucene.apache.org list. Thank you for your contributi

Re: [PR] Add new Directory implementation for AWS S3 [lucene]

2025-02-03 Thread via GitHub
dsmiley commented on PR #13949: URL: https://github.com/apache/lucene/pull/13949#issuecomment-2632403618 By the way, the Apache Solr project has an impressive "[BlockCache](https://github.com/apache/solr/blob/07943e87fb762b69a66932f777d56eb14cc72e78/solr/modules/hdfs/src/java/org/apache/solr

Re: [PR] Add new Directory implementation for AWS S3 [lucene]

2025-02-03 Thread via GitHub
dsmiley commented on PR #13949: URL: https://github.com/apache/lucene/pull/13949#issuecomment-2632388607 Couldn't S3 and other file storage be implemented as an NIO FileSystem instead? AKA JSR-203. Would the Lucene Directory abstraction level have certain advantages (what)? Ideally we'd

Re: [PR] Add new Directory implementation for AWS S3 [lucene]

2024-12-07 Thread via GitHub
github-actions[bot] commented on PR #13949: URL: https://github.com/apache/lucene/pull/13949#issuecomment-2525356998 This PR has not had activity in the past 2 weeks, labeling it as stale. If the PR is waiting for review, notify the d...@lucene.apache.org list. Thank you for your contributi

Re: [PR] Add new Directory implementation for AWS S3 [lucene]

2024-11-23 Thread via GitHub
albogdano commented on PR #13949: URL: https://github.com/apache/lucene/pull/13949#issuecomment-2495524936 I refactored the code and got rid of the AWS SDK in favor of the minimal S3 Client [aws-lightweight-client-java](https://github.com/davidmoten/aws-lightweight-client-java). Also,

Re: [PR] Add new Directory implementation for AWS S3 [lucene]

2024-11-13 Thread via GitHub
jpountz commented on PR #13949: URL: https://github.com/apache/lucene/pull/13949#issuecomment-2472787711 Since we have a S3 Directory implemented already, let's run a comparison with the fuse mount approach? -- This is an automated message from the Apache Git Service. To respond to the me

Re: [PR] Add new Directory implementation for AWS S3 [lucene]

2024-11-04 Thread via GitHub
reta commented on PR #13949: URL: https://github.com/apache/lucene/pull/13949#issuecomment-2454758505 Just to share similar developments, we did S3 filesystem plugin store prototype for OpenSearch based on the same library [1], it works but the latency wise, it is not great (obviously, no c

Re: [PR] Add new Directory implementation for AWS S3 [lucene]

2024-11-04 Thread via GitHub
albogdano commented on PR #13949: URL: https://github.com/apache/lucene/pull/13949#issuecomment-2454576266 @rmuir @josefschiefer27 I haven't yet tried experimenting with S3 fuse mounts but I doubt the performance will be any better than using the S3 API directly. Also, there are some specif

Re: [PR] Add new Directory implementation for AWS S3 [lucene]

2024-11-02 Thread via GitHub
josefschiefer27 commented on PR #13949: URL: https://github.com/apache/lucene/pull/13949#issuecomment-2453047850 Amazon S3 also offers its own solution for mounting S3 buckets, available at https://github.com/awslabs/mountpoint-s3/. There are benchmarks at https://github.com/awslabs/

Re: [PR] Add new Directory implementation for AWS S3 [lucene]

2024-11-02 Thread via GitHub
rmuir commented on PR #13949: URL: https://github.com/apache/lucene/pull/13949#issuecomment-2453024865 Have we explored trying a fuse-based mount instead of writing java code? There seems to be several potentially viable approaches there: https://github.com/s3fs-fuse/s3fs-fuse, but ma

Re: [PR] Add new Directory implementation for AWS S3 [lucene]

2024-10-24 Thread via GitHub
jpountz commented on code in PR #13949: URL: https://github.com/apache/lucene/pull/13949#discussion_r1815193079 ## lucene/s3directory/build.gradle: ## @@ -0,0 +1,38 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one or more + * contributor license agreements.

[PR] Add new Directory implementation for AWS S3 [lucene]

2024-10-23 Thread via GitHub
albogdano opened a new pull request, #13949: URL: https://github.com/apache/lucene/pull/13949 ### Description This PR adds a new module `s3directory` to Lucene, containing a new `Directory` implementation for AWS S3. The code was adapted from the [lucene-s3-directory](https://gith