Re: [PR] Spec v3: Add deletion vectors to the table spec [iceberg]

2024-10-14 Thread via GitHub
szehon-ho commented on code in PR #11240: URL: https://github.com/apache/iceberg/pull/11240#discussion_r1800307665 ## format/spec.md: ## @@ -841,19 +855,45 @@ Notes: ## Delete Formats -This section details how to encode row-level deletes in Iceberg delete files. Row-level

Re: [PR] Core: Rename DeleteFileHolder to PendingDeleteFile / Optimize duplicate data/delete file detection [iceberg]

2024-10-14 Thread via GitHub
nastra commented on code in PR #11254: URL: https://github.com/apache/iceberg/pull/11254#discussion_r1800558085 ## core/src/main/java/org/apache/iceberg/MergingSnapshotProducer.java: ## @@ -82,11 +82,9 @@ abstract class MergingSnapshotProducer extends SnapshotProducer { priv

Re: [PR] Core: Rename DeleteFileHolder to PendingDeleteFile / Optimize duplicate data/delete file detection [iceberg]

2024-10-14 Thread via GitHub
nastra commented on code in PR #11254: URL: https://github.com/apache/iceberg/pull/11254#discussion_r1800552749 ## core/src/main/java/org/apache/iceberg/MergingSnapshotProducer.java: ## @@ -1005,7 +1002,8 @@ private List newDeleteFilesAsManifests() { newDeleteFilesBySpec.

Re: [PR] Core: Rename DeleteFileHolder to PendingDeleteFile / Optimize duplicate data/delete file detection [iceberg]

2024-10-14 Thread via GitHub
nastra commented on code in PR #11254: URL: https://github.com/apache/iceberg/pull/11254#discussion_r1800552495 ## core/src/main/java/org/apache/iceberg/MergingSnapshotProducer.java: ## @@ -974,7 +970,8 @@ private List newDataFilesAsManifests() { newDataFilesBySpec.forEac

Re: [PR] Core: Rename DeleteFileHolder to PendingDeleteFile / Optimize duplicate data/delete file detection [iceberg]

2024-10-14 Thread via GitHub
nastra commented on code in PR #11254: URL: https://github.com/apache/iceberg/pull/11254#discussion_r1800550234 ## core/src/main/java/org/apache/iceberg/SnapshotProducer.java: ## @@ -772,17 +773,139 @@ protected static class DeleteFileHolder { * * @param deleteFile d

Re: [PR] Flink: Add IcebergSinkBuilder interface allowed unification of most of operations on FlinkSink and IcebergSink Builders [iceberg]

2024-10-14 Thread via GitHub
arkadius commented on PR #11305: URL: https://github.com/apache/iceberg/pull/11305#issuecomment-2413009425 > > > @arkadius please take a look as the CI is broken > > > > > > Do you have an option to retry this build stage? It is rather impossible that extraction of an interface co

Re: [PR] Core: Switch usage to DataFileSet / DeleteFileSet [iceberg]

2024-10-14 Thread via GitHub
nastra commented on code in PR #11158: URL: https://github.com/apache/iceberg/pull/11158#discussion_r1800532477 ## hive-metastore/src/test/java/org/apache/iceberg/hive/HiveTableTest.java: ## @@ -213,7 +213,7 @@ public void testDropTable() throws IOException { table.newAppen

Re: [PR] (AWS) Docs: List all AWS S3 properties from all language impl. [iceberg]

2024-10-14 Thread via GitHub
hsiang-c commented on PR #11321: URL: https://github.com/apache/iceberg/pull/11321#issuecomment-2412945006 cc @Fokko @Xuanwo for reviews, thanks. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to t

[PR] (AWS) Docs: List all AWS S3 properties from all language impl. [iceberg]

2024-10-14 Thread via GitHub
hsiang-c opened a new pull request, #11321: URL: https://github.com/apache/iceberg/pull/11321 ### Note to reviewers - Closes https://github.com/apache/iceberg/issues/10674 - I moved S3 properties to its own doc (`aws-s3-fileio-properties.md`) and link to it from the original `aws.md

Re: [PR] Add Snowflake catalog [iceberg-python]

2024-10-14 Thread via GitHub
prabodh1194 closed pull request #687: Add Snowflake catalog URL: https://github.com/apache/iceberg-python/pull/687 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscrib

Re: [PR] Add Snowflake catalog [iceberg-python]

2024-10-14 Thread via GitHub
prabodh1194 commented on PR #687: URL: https://github.com/apache/iceberg-python/pull/687#issuecomment-2412914581 closing as I won't be able to continue with this PR now. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

Re: [PR] Revert "feat: Add equality delete writer (#372)" [iceberg-rust]

2024-10-14 Thread via GitHub
Xuanwo commented on PR #672: URL: https://github.com/apache/iceberg-rust/pull/672#issuecomment-2412892346 Hi, @kevinjqliu, could you take a look? Thanks! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

Re: [PR] feat: Add equality delete writer [iceberg-rust]

2024-10-14 Thread via GitHub
Xuanwo commented on PR #372: URL: https://github.com/apache/iceberg-rust/pull/372#issuecomment-2412889476 Hi, I'm sorry, but I need to revert this PR. @Dysprosium0626, could you reopen and rebase your original PR and test it again? -- This is an automated message from the Apache Git Servi

Re: [PR] chore: Fix build after merge [iceberg-rust]

2024-10-14 Thread via GitHub
Xuanwo closed pull request #670: chore: Fix build after merge URL: https://github.com/apache/iceberg-rust/pull/670 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscrib

[PR] Revert "feat: Add equality delete writer (#372)" [iceberg-rust]

2024-10-14 Thread via GitHub
Xuanwo opened a new pull request, #672: URL: https://github.com/apache/iceberg-rust/pull/672 This reverts commit ad89eac02712ceac2c3cff6bf0fe5d1b6e289a26. I have to revert PR #372 since it can't pass the unit tests and I didn't find a quick way to fix it. -- This is an automated me

Re: [PR] chore: Fix build after merge [iceberg-rust]

2024-10-14 Thread via GitHub
Xuanwo commented on PR #670: URL: https://github.com/apache/iceberg-rust/pull/670#issuecomment-2412887881 Replaced by https://github.com/apache/iceberg-rust/pull/672 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

Re: [PR] Spec: Support geo type [iceberg]

2024-10-14 Thread via GitHub
paleolimbot commented on code in PR #10981: URL: https://github.com/apache/iceberg/pull/10981#discussion_r1800414577 ## format/spec.md: ## @@ -483,6 +485,8 @@ Notes: 2. For `float` and `double`, the value `-0.0` must precede `+0.0`, as in the IEEE 754 `totalOrder` predicate. N

Re: [I] Implement the equality delete writer [iceberg-rust]

2024-10-14 Thread via GitHub
Xuanwo closed issue #341: Implement the equality delete writer URL: https://github.com/apache/iceberg-rust/issues/341 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubsc

Re: [PR] feat: Add equality delete writer [iceberg-rust]

2024-10-14 Thread via GitHub
Xuanwo merged PR #372: URL: https://github.com/apache/iceberg-rust/pull/372 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.a

Re: [PR] Puffin: Add delete-vector-v1 blob type [iceberg]

2024-10-14 Thread via GitHub
advancedxy commented on code in PR #11238: URL: https://github.com/apache/iceberg/pull/11238#discussion_r1800356102 ## format/puffin-spec.md: ## @@ -123,6 +123,54 @@ The blob metadata for this blob may include following properties: - `ndv`: estimate of number of distinct val

[I] Manifest List/Entry Creation [iceberg-go]

2024-10-14 Thread via GitHub
dwilson1988 opened a new issue, #172: URL: https://github.com/apache/iceberg-go/issues/172 ### Feature Request / Improvement Hello, I'm working on a use case where I need to be my own catalog and need to be able to create my own Iceberg tables purely in Go. I understand that table cr

Re: [PR] Spec: Fix table of content generation [iceberg]

2024-10-14 Thread via GitHub
danielcweeks commented on code in PR #11067: URL: https://github.com/apache/iceberg/pull/11067#discussion_r1800321978 ## format/spec.md: ## @@ -158,27 +158,27 @@ Readers should be more permissive because v1 metadata files are allowed in v2 ta Readers may be more strict for m

Re: [PR] Spec: Fix table of content generation [iceberg]

2024-10-14 Thread via GitHub
ajantha-bhat commented on PR #11067: URL: https://github.com/apache/iceberg/pull/11067#issuecomment-2412652792 New TOC with this change https://github.com/user-attachments/assets/e469de32-a608-4277-8dda-63c40b0fe0e9";> -- This is an automated message from the Apache Git Service.

Re: [PR] Spec: Fix table of content generation [iceberg]

2024-10-14 Thread via GitHub
ajantha-bhat commented on code in PR #11067: URL: https://github.com/apache/iceberg/pull/11067#discussion_r1800310393 ## format/spec.md: ## @@ -121,9 +121,9 @@ Tables do not require random-access writes. Once written, data and metadata file Tables do not require rename, except

Re: [PR] Spec v3: Add deletion vectors to the table spec [iceberg]

2024-10-14 Thread via GitHub
szehon-ho commented on code in PR #11240: URL: https://github.com/apache/iceberg/pull/11240#discussion_r1800282377 ## format/spec.md: ## @@ -454,35 +457,40 @@ The schema of a manifest file is a struct called `manifest_entry` with the follo `data_file` is a struct with the fo

Re: [I] Implement remaining operations for Glue catalog [iceberg-go]

2024-10-14 Thread via GitHub
vivekkoya commented on issue #64: URL: https://github.com/apache/iceberg-go/issues/64#issuecomment-2412610788 Hello, I can take this task. How can I get started? Can you please direct me to the relevant files and directories? Thanks for the help -- This is an automated message f

Re: [PR] Core: Optimize MergingSnapshotProducer to use referenced manifests to determine if manifest needs to be rewritten [iceberg]

2024-10-14 Thread via GitHub
amogh-jahagirdar commented on code in PR #11131: URL: https://github.com/apache/iceberg/pull/11131#discussion_r1800283158 ## core/src/main/java/org/apache/iceberg/ManifestFilterManager.java: ## @@ -325,7 +341,15 @@ private ManifestFile filterManifest(Schema tableSchema, Manifes

Re: [PR] Core: Optimize MergingSnapshotProducer to use referenced manifests to determine if manifest needs to be rewritten [iceberg]

2024-10-14 Thread via GitHub
amogh-jahagirdar commented on code in PR #11131: URL: https://github.com/apache/iceberg/pull/11131#discussion_r1800279697 ## core/src/main/java/org/apache/iceberg/ManifestFilterManager.java: ## @@ -325,7 +341,15 @@ private ManifestFile filterManifest(Schema tableSchema, Manifes

Re: [PR] Core: Optimize MergingSnapshotProducer to use referenced manifests to determine if manifest needs to be rewritten [iceberg]

2024-10-14 Thread via GitHub
amogh-jahagirdar commented on code in PR #11131: URL: https://github.com/apache/iceberg/pull/11131#discussion_r1800279697 ## core/src/main/java/org/apache/iceberg/ManifestFilterManager.java: ## @@ -325,7 +341,15 @@ private ManifestFile filterManifest(Schema tableSchema, Manifes

Re: [PR] Core: Optimize MergingSnapshotProducer to use referenced manifests to determine if manifest needs to be rewritten [iceberg]

2024-10-14 Thread via GitHub
amogh-jahagirdar commented on code in PR #11131: URL: https://github.com/apache/iceberg/pull/11131#discussion_r1800279697 ## core/src/main/java/org/apache/iceberg/ManifestFilterManager.java: ## @@ -325,7 +341,15 @@ private ManifestFile filterManifest(Schema tableSchema, Manifes

Re: [PR] Core: Optimize MergingSnapshotProducer to use referenced manifests to determine if manifest needs to be rewritten [iceberg]

2024-10-14 Thread via GitHub
amogh-jahagirdar commented on code in PR #11131: URL: https://github.com/apache/iceberg/pull/11131#discussion_r1800279697 ## core/src/main/java/org/apache/iceberg/ManifestFilterManager.java: ## @@ -325,7 +341,15 @@ private ManifestFile filterManifest(Schema tableSchema, Manifes

Re: [I] PyIceberg Production Use case survey [iceberg-python]

2024-10-14 Thread via GitHub
mariotaddeucci commented on issue #1202: URL: https://github.com/apache/iceberg-python/issues/1202#issuecomment-2412604690 Hey, actually I'm using in production for small datasets in combination with duckdb specially to avoid small files with webscrapping. For ingestion, reading many

Re: [PR] Puffin: Add delete-vector-v1 blob type [iceberg]

2024-10-14 Thread via GitHub
emkornfield commented on code in PR #11238: URL: https://github.com/apache/iceberg/pull/11238#discussion_r1800277755 ## format/puffin-spec.md: ## @@ -123,6 +123,54 @@ The blob metadata for this blob may include following properties: - `ndv`: estimate of number of distinct va

Re: [PR] Puffin: Add delete-vector-v1 blob type [iceberg]

2024-10-14 Thread via GitHub
emkornfield commented on code in PR #11238: URL: https://github.com/apache/iceberg/pull/11238#discussion_r1799796613 ## format/puffin-spec.md: ## @@ -123,6 +123,54 @@ The blob metadata for this blob may include following properties: - `ndv`: estimate of number of distinct va

Re: [PR] Core: Optimize MergingSnapshotProducer to use referenced manifests to determine if manifest needs to be rewritten [iceberg]

2024-10-14 Thread via GitHub
amogh-jahagirdar commented on code in PR #11131: URL: https://github.com/apache/iceberg/pull/11131#discussion_r1800275372 ## core/src/main/java/org/apache/iceberg/ManifestFilterManager.java: ## @@ -370,14 +407,7 @@ private boolean canContainDeletedFiles(ManifestFile manifest) {

Re: [PR] Core: Optimize MergingSnapshotProducer to use referenced manifests to determine if manifest needs to be rewritten [iceberg]

2024-10-14 Thread via GitHub
amogh-jahagirdar commented on code in PR #11131: URL: https://github.com/apache/iceberg/pull/11131#discussion_r1800273374 ## core/src/main/java/org/apache/iceberg/ManifestFilterManager.java: ## @@ -78,9 +78,11 @@ public String partition() { private boolean failMissingDeletePa

Re: [PR] Puffin: Add delete-vector-v1 blob type [iceberg]

2024-10-14 Thread via GitHub
emkornfield commented on code in PR #11238: URL: https://github.com/apache/iceberg/pull/11238#discussion_r1799766917 ## format/puffin-spec.md: ## @@ -123,6 +123,54 @@ The blob metadata for this blob may include following properties: - `ndv`: estimate of number of distinct va

Re: [PR] Rest Catalog: Add RESTful AppendFiles data operation [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed pull request #9292: Rest Catalog: Add RESTful AppendFiles data operation URL: https://github.com/apache/iceberg/pull/9292 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to t

Re: [PR] Rest Catalog: Add RESTful AppendFiles data operation [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on PR #9292: URL: https://github.com/apache/iceberg/pull/9292#issuecomment-2412571003 This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If y

Re: [PR] Add support for CreateScan and GetScanTasks in RESTCatalog [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed pull request #9252: Add support for CreateScan and GetScanTasks in RESTCatalog URL: https://github.com/apache/iceberg/pull/9252 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to g

Re: [PR] Add support for CreateScan and GetScanTasks in RESTCatalog [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on PR #9252: URL: https://github.com/apache/iceberg/pull/9252#issuecomment-2412570958 This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If y

Re: [PR] Spark: IN clause on system function is not pushed down [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed pull request #9192: Spark: IN clause on system function is not pushed down URL: https://github.com/apache/iceberg/pull/9192 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

Re: [PR] Core: Suppress exceptions in case of dropTableData [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on PR #9184: URL: https://github.com/apache/iceberg/pull/9184#issuecomment-2412570820 This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If y

Re: [PR] Parquet: Add a table property to control the Parquet row-group size of position delete files [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on PR #9177: URL: https://github.com/apache/iceberg/pull/9177#issuecomment-2412570798 This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If y

Re: [I] Merge into second commit when with no changes [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed issue #9024: Merge into second commit when with no changes URL: https://github.com/apache/iceberg/issues/9024 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [I] Flink SQL SELECT ORDER BY clause caused data loss. [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed issue #9022: Flink SQL SELECT ORDER BY clause caused data loss. URL: https://github.com/apache/iceberg/issues/9022 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spec

Re: [I] Creating a hive Managed Table? [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on issue #9013: URL: https://github.com/apache/iceberg/issues/9013#issuecomment-2412570660 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] [Feature Request] Implement `equals` for `RESTMessage` [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed issue #9003: [Feature Request] Implement `equals` for `RESTMessage` URL: https://github.com/apache/iceberg/issues/9003 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [I] Data duplicate after the partition is modified [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on issue #8979: URL: https://github.com/apache/iceberg/issues/8979#issuecomment-2412570619 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] org.apache.iceberg.spark.source.SerializableTableWithSize cannot be cast to org.apache.iceberg.Table [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on issue #8978: URL: https://github.com/apache/iceberg/issues/8978#issuecomment-2412570596 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] hive integration iceberg related problems [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on issue #8993: URL: https://github.com/apache/iceberg/issues/8993#issuecomment-2412570639 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] hive integration iceberg related problems [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed issue #8993: hive integration iceberg related problems URL: https://github.com/apache/iceberg/issues/8993 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comm

Re: [I] View is no longer in sync with table after catalog cache entry expires [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on issue #8977: URL: https://github.com/apache/iceberg/issues/8977#issuecomment-2412570575 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] Support MOR CDC view [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on issue #8975: URL: https://github.com/apache/iceberg/issues/8975#issuecomment-2412570554 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] Support adding an additional `opType` column when creating a table [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on issue #8973: URL: https://github.com/apache/iceberg/issues/8973#issuecomment-2412570494 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] Support adding an additional `opType` column when creating a table [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed issue #8973: Support adding an additional `opType` column when creating a table URL: https://github.com/apache/iceberg/issues/8973 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above t

Re: [PR] fix when equalityFieldColumns is not null and upsert is false, position delete in write function will lead to unstable result if flink checkpoint interval is not same [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed pull request #9300: fix when equalityFieldColumns is not null and upsert is false, position delete in write function will lead to unstable result if flink checkpoint interval is not same URL: https://github.com/apache/iceberg/pull/9300 -- This is an automated messa

Re: [I] Why are updateSchema and UpdatePartitionSpec commit not retried? [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed issue #8964: Why are updateSchema and UpdatePartitionSpec commit not retried? URL: https://github.com/apache/iceberg/issues/8964 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

Re: [I] flink1.13.2+iceberg0.13.0+hive-metastore3.0.0+minio(S3) Forbidden (Service: Amazon S3; Status Code: 403 [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed issue #8968: flink1.13.2+iceberg0.13.0+hive-metastore3.0.0+minio(S3) Forbidden (Service: Amazon S3; Status Code: 403 URL: https://github.com/apache/iceberg/issues/8968 -- This is an automated message from the Apache Git Service. To respond to the message, please lo

Re: [PR] fix when equalityFieldColumns is not null and upsert is false, position delete in write function will lead to unstable result if flink checkpoint interval is not same [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on PR #9300: URL: https://github.com/apache/iceberg/pull/9300#issuecomment-2412571038 This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If y

Re: [I] flink1.13.2+iceberg0.13.0+hive-metastore3.0.0+minio(S3) Forbidden (Service: Amazon S3; Status Code: 403 [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on issue #8968: URL: https://github.com/apache/iceberg/issues/8968#issuecomment-2412570467 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] Question on BaseMetastoreViewCatalog#buildView [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed issue #8967: Question on BaseMetastoreViewCatalog#buildView URL: https://github.com/apache/iceberg/issues/8967 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specifi

Re: [I] Why are updateSchema and UpdatePartitionSpec commit not retried? [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on issue #8964: URL: https://github.com/apache/iceberg/issues/8964#issuecomment-2412570423 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [PR] Rest Catalog: Add RESTful data operations [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on PR #9237: URL: https://github.com/apache/iceberg/pull/9237#issuecomment-2412570932 This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If y

Re: [PR] Spark SystemFunctions are not pushed down during JOIN [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed pull request #9233: Spark SystemFunctions are not pushed down during JOIN URL: https://github.com/apache/iceberg/pull/9233 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

Re: [I] Does the Java API support primary keys for creating tables [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed issue #8950: Does the Java API support primary keys for creating tables URL: https://github.com/apache/iceberg/issues/8950 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

Re: [PR] Rest Catalog: Add RESTful data operations [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed pull request #9237: Rest Catalog: Add RESTful data operations URL: https://github.com/apache/iceberg/pull/9237 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [PR] Spark: Use Awaitility instead of Thread.sleep [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed pull request #9224: Spark: Use Awaitility instead of Thread.sleep URL: https://github.com/apache/iceberg/pull/9224 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spec

Re: [I] Long overflow when Iceberg reading INT96 timestamp column from Spark parquet table [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed issue #8949: Long overflow when Iceberg reading INT96 timestamp column from Spark parquet table URL: https://github.com/apache/iceberg/issues/8949 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

Re: [PR] Spark: Use Awaitility instead of Thread.sleep [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on PR #9224: URL: https://github.com/apache/iceberg/pull/9224#issuecomment-2412570880 This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If y

Re: [PR] Spark SystemFunctions are not pushed down during JOIN [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on PR #9233: URL: https://github.com/apache/iceberg/pull/9233#issuecomment-2412570907 This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If y

Re: [I] equality delete files can be removed immediately after rewrite? [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on issue #8933: URL: https://github.com/apache/iceberg/issues/8933#issuecomment-2412570345 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [PR] Core: Suppress exceptions in case of dropTableData [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed pull request #9184: Core: Suppress exceptions in case of dropTableData URL: https://github.com/apache/iceberg/pull/9184 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [PR] Spark: IN clause on system function is not pushed down [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on PR #9192: URL: https://github.com/apache/iceberg/pull/9192#issuecomment-2412570842 This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If y

Re: [PR] Remove redundant error propagation check. [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on PR #9143: URL: https://github.com/apache/iceberg/pull/9143#issuecomment-2412570759 This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If y

Re: [PR] Parquet: Add a table property to control the Parquet row-group size of position delete files [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed pull request #9177: Parquet: Add a table property to control the Parquet row-group size of position delete files URL: https://github.com/apache/iceberg/pull/9177 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

Re: [PR] Core, Hive, Nessie: Use ResolvingFileIO as default instead of HadoopFileIO [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on PR #8272: URL: https://github.com/apache/iceberg/pull/8272#issuecomment-2412570302 This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull

Re: [PR] Remove redundant error propagation check. [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed pull request #9143: Remove redundant error propagation check. URL: https://github.com/apache/iceberg/pull/9143 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specifi

Re: [PR] Added test about CDC creation failure after expiring all the snapshots [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on PR #9141: URL: https://github.com/apache/iceberg/pull/9141#issuecomment-2412570736 This pull request has been closed due to lack of activity. This is not a judgement on the merit of the PR in any way. It is just a way of keeping the PR queue manageable. If y

Re: [PR] Added test about CDC creation failure after expiring all the snapshots [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed pull request #9141: Added test about CDC creation failure after expiring all the snapshots URL: https://github.com/apache/iceberg/pull/9141 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the UR

Re: [I] View is no longer in sync with table after catalog cache entry expires [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed issue #8977: View is no longer in sync with table after catalog cache entry expires URL: https://github.com/apache/iceberg/issues/8977 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL abo

Re: [I] Creating a hive Managed Table? [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed issue #9013: Creating a hive Managed Table? URL: https://github.com/apache/iceberg/issues/9013 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To uns

Re: [I] Data duplicate after the partition is modified [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed issue #8979: Data duplicate after the partition is modified URL: https://github.com/apache/iceberg/issues/8979 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [I] org.apache.iceberg.spark.source.SerializableTableWithSize cannot be cast to org.apache.iceberg.Table [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed issue #8978: org.apache.iceberg.spark.source.SerializableTableWithSize cannot be cast to org.apache.iceberg.Table URL: https://github.com/apache/iceberg/issues/8978 -- This is an automated message from the Apache Git Service. To respond to the message, please log o

Re: [I] Support MOR CDC view [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed issue #8975: Support MOR CDC view URL: https://github.com/apache/iceberg/issues/8975 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe,

Re: [I] Question on BaseMetastoreViewCatalog#buildView [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on issue #8967: URL: https://github.com/apache/iceberg/issues/8967#issuecomment-2412570449 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] Does the Java API support primary keys for creating tables [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on issue #8950: URL: https://github.com/apache/iceberg/issues/8950#issuecomment-2412570398 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] flink:FlinkSink support dynamically changed schema [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on issue #4190: URL: https://github.com/apache/iceberg/issues/4190#issuecomment-2412570266 This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs.

Re: [I] Long overflow when Iceberg reading INT96 timestamp column from Spark parquet table [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on issue #8949: URL: https://github.com/apache/iceberg/issues/8949#issuecomment-2412570370 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] equality delete files can be removed immediately after rewrite? [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] closed issue #8933: equality delete files can be removed immediately after rewrite? URL: https://github.com/apache/iceberg/issues/8933 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to g

Re: [I] Flink: Decouple the iceberg integration work from hadoop libraries [iceberg]

2024-10-14 Thread via GitHub
github-actions[bot] commented on issue #3117: URL: https://github.com/apache/iceberg/issues/3117#issuecomment-2412570245 This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs.

Re: [I] EPIC: Rust Based Compaction [iceberg-rust]

2024-10-14 Thread via GitHub
camuel commented on issue #624: URL: https://github.com/apache/iceberg-rust/issues/624#issuecomment-2412544556 Does anyone has any insights on how computation heavy is the compaction workload really? Like on a beefy machine what compaction rate will be possible? Like 1GB/sec? 10GB/sec? A ba

Re: [PR] API, Core: Add scan planning apis to REST Catalog [iceberg]

2024-10-14 Thread via GitHub
rahil-c commented on code in PR #11180: URL: https://github.com/apache/iceberg/pull/11180#discussion_r1800245093 ## core/src/main/java/org/apache/iceberg/rest/RESTFileScanTaskParser.java: ## @@ -0,0 +1,109 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + *

Re: [PR] API, Core: Add scan planning apis to REST Catalog [iceberg]

2024-10-14 Thread via GitHub
rahil-c commented on code in PR #11180: URL: https://github.com/apache/iceberg/pull/11180#discussion_r1800243915 ## core/src/main/java/org/apache/iceberg/RESTPlanningMode.java: ## @@ -0,0 +1,47 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or more con

Re: [PR] API, Core: Add scan planning apis to REST Catalog [iceberg]

2024-10-14 Thread via GitHub
rahil-c commented on code in PR #11180: URL: https://github.com/apache/iceberg/pull/11180#discussion_r1800243915 ## core/src/main/java/org/apache/iceberg/RESTPlanningMode.java: ## @@ -0,0 +1,47 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or more con

Re: [PR] Puffin: Add delete-vector-v1 blob type [iceberg]

2024-10-14 Thread via GitHub
rdblue commented on code in PR #11238: URL: https://github.com/apache/iceberg/pull/11238#discussion_r1800243797 ## format/puffin-spec.md: ## @@ -123,6 +123,49 @@ The blob metadata for this blob may include following properties: - `ndv`: estimate of number of distinct values,

Re: [PR] Puffin: Add delete-vector-v1 blob type [iceberg]

2024-10-14 Thread via GitHub
aokolnychyi commented on code in PR #11238: URL: https://github.com/apache/iceberg/pull/11238#discussion_r1800243303 ## format/puffin-spec.md: ## @@ -123,6 +123,44 @@ The blob metadata for this blob may include following properties: - `ndv`: estimate of number of distinct va

Re: [PR] feat(catalog/glue): add support for list namespaces [iceberg-go]

2024-10-14 Thread via GitHub
oguzerdogmus commented on code in PR #169: URL: https://github.com/apache/iceberg-go/pull/169#discussion_r1800236271 ## catalog/glue.go: ## @@ -150,8 +151,33 @@ func (c *GlueCatalog) UpdateNamespaceProperties(ctx context.Context, namespace t return PropertiesUpdateSumma

Re: [PR] Hive: Use EnvironmentContext instead of Hive Locks to provide transactional commits after HIVE-26882 [iceberg]

2024-10-14 Thread via GitHub
pvary commented on PR #6570: URL: https://github.com/apache/iceberg/pull/6570#issuecomment-2412467877 @chenwyi2: If you backport the changes to Hive 1, then you can use the feature. I suggest to create your own release for Iceberg as well. -- This is an automated message from the Apache G

Re: [I] PyIceberg Near-Term Roadmap [iceberg-python]

2024-10-14 Thread via GitHub
jaehyeon-kim commented on issue #736: URL: https://github.com/apache/iceberg-python/issues/736#issuecomment-2412464577 It look BigLake metastore is going to be replaced to BigQuery metastore. Is the version 0.8.0 still relevant? https://github.com/trinodb/trino/issues/20031#issuecomme

  1   2   3   >