Re: [I] [DISCUSSION] Project Goal [iceberg-cpp]

2025-01-15 Thread via GitHub
MisterRaindrop commented on issue #2: URL: https://github.com/apache/iceberg-cpp/issues/2#issuecomment-2592538698 A little immature idea! The current implementation of Iceberg I am not use Arrow at all. I am currently reading data directly through Parquet, Avro, etc. I would like to ha

Re: [PR] feat: add file_io and local fs impl [iceberg-cpp]

2025-01-15 Thread via GitHub
zhjwpku commented on PR #30: URL: https://github.com/apache/iceberg-cpp/pull/30#issuecomment-2592893837 @wgtmac @Fokko @pitrou @Xuanwo @raulcd Please take a look at this early implementation of FileIO, I plan to add more interfaces, before that, I hope to hear some advices, thanks. -- T

Re: [PR] Core, API, Spec: Metadata Row Lineage [iceberg]

2025-01-15 Thread via GitHub
nastra commented on code in PR #11948: URL: https://github.com/apache/iceberg/pull/11948#discussion_r1916250301 ## core/src/test/java/org/apache/iceberg/TestRowLineageMetadata.java: ## @@ -0,0 +1,339 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or mo

Re: [PR] Core, API, Spec: Metadata Row Lineage [iceberg]

2025-01-15 Thread via GitHub
nastra commented on code in PR #11948: URL: https://github.com/apache/iceberg/pull/11948#discussion_r1916254927 ## core/src/test/java/org/apache/iceberg/TestRowLineageMetadata.java: ## @@ -0,0 +1,339 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or mo

Re: [I] Partition writes not creating expected directory hierarchy on S3 (MinIO) [iceberg-rust]

2025-01-15 Thread via GitHub
ZENOTME commented on issue #891: URL: https://github.com/apache/iceberg-rust/issues/891#issuecomment-2592146087 > I work with a startup building a distributed query engine for large-scale Iceberg tables (>1PB). The partition-specific hierarchical directory becomes crucial from a query engin

Re: [PR] Core, API, Spec: Metadata Row Lineage [iceberg]

2025-01-15 Thread via GitHub
nastra commented on code in PR #11948: URL: https://github.com/apache/iceberg/pull/11948#discussion_r1916251639 ## core/src/test/java/org/apache/iceberg/TestRowLineageMetadata.java: ## @@ -0,0 +1,339 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or mo

Re: [PR] Run `java-ci` on changes in `open-api/**` [iceberg]

2025-01-15 Thread via GitHub
Fokko merged PR #11972: URL: https://github.com/apache/iceberg/pull/11972 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apa

Re: [PR] Core, API, Spec: Metadata Row Lineage [iceberg]

2025-01-15 Thread via GitHub
nastra commented on code in PR #11948: URL: https://github.com/apache/iceberg/pull/11948#discussion_r1916256928 ## core/src/main/java/org/apache/iceberg/util/JsonUtil.java: ## @@ -144,6 +144,13 @@ public static long getLong(String property, JsonNode node) { return pNode.as

Re: [PR] Core, API, Spec: Metadata Row Lineage [iceberg]

2025-01-15 Thread via GitHub
nastra commented on code in PR #11948: URL: https://github.com/apache/iceberg/pull/11948#discussion_r1916260635 ## core/src/main/java/org/apache/iceberg/BaseSnapshot.java: ## @@ -61,7 +63,9 @@ class BaseSnapshot implements Snapshot { String operation, Map summary,

Re: [PR] Bump Spark to Scala 2.13 [iceberg-python]

2025-01-15 Thread via GitHub
Fokko commented on PR #1515: URL: https://github.com/apache/iceberg-python/pull/1515#issuecomment-2592163686 PySpark comes with 2.12 :( -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specifi

Re: [PR] Bump Spark to Scala 2.13 [iceberg-python]

2025-01-15 Thread via GitHub
Fokko closed pull request #1515: Bump Spark to Scala 2.13 URL: https://github.com/apache/iceberg-python/pull/1515 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe

Re: [PR] Core, API, Spec: Metadata Row Lineage [iceberg]

2025-01-15 Thread via GitHub
nastra commented on code in PR #11948: URL: https://github.com/apache/iceberg/pull/11948#discussion_r1916263730 ## core/src/main/java/org/apache/iceberg/MetadataUpdate.java: ## @@ -517,4 +517,11 @@ public void applyTo(ViewMetadata.Builder viewMetadataBuilder) { viewMetad

Re: [PR] Core, API, Spec: Metadata Row Lineage [iceberg]

2025-01-15 Thread via GitHub
nastra commented on code in PR #11948: URL: https://github.com/apache/iceberg/pull/11948#discussion_r1916267686 ## core/src/main/java/org/apache/iceberg/MetadataUpdate.java: ## @@ -517,4 +517,11 @@ public void applyTo(ViewMetadata.Builder viewMetadataBuilder) { viewMetad

Re: [PR] Doc: Add DELETE ORPHAN-FILES example to Hive page [iceberg]

2025-01-15 Thread via GitHub
ebyhr commented on PR #11896: URL: https://github.com/apache/iceberg/pull/11896#issuecomment-2591936229 > Is it available in Hive 4.0.0? Yes. > Do we have other maintenance features, like compaction, manifest rewrite, expire snapshot that are available in Hive as well?

Re: [PR] Core: Fix loading a table in CachingCatalog with metadata table name [iceberg]

2025-01-15 Thread via GitHub
gaborkaszab commented on PR #11738: URL: https://github.com/apache/iceberg/pull/11738#issuecomment-2591938631 Thanks for the reviews @manuzhang @wypoon @pvary @nastra ! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use t

Re: [I] [Bug] Error in overwrite(): pyarrow.lib.ArrowInvalid: offset overflow with large dataset (~3M rows) [iceberg-python]

2025-01-15 Thread via GitHub
bigluck commented on issue #1491: URL: https://github.com/apache/iceberg-python/issues/1491#issuecomment-2592657361 We're facing the same problem by using pyiceberg 0.7.1 :( -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

Re: [PR] Build: Bump openapi-generator plugin from 6.6.0 to 7.10.0 [iceberg]

2025-01-15 Thread via GitHub
nastra commented on PR #11970: URL: https://github.com/apache/iceberg/pull/11970#issuecomment-2591950363 merging as CI failure is unrelated -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spe

Re: [PR] Build: Bump openapi-generator plugin from 6.6.0 to 7.10.0 [iceberg]

2025-01-15 Thread via GitHub
nastra merged PR #11970: URL: https://github.com/apache/iceberg/pull/11970 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.ap

Re: [PR] [1.7.x] AWS, Core, GCP: Support relative credential endpoint / pass OAuth2 token to credential provider [iceberg]

2025-01-15 Thread via GitHub
nastra closed pull request #11971: [1.7.x] AWS, Core, GCP: Support relative credential endpoint / pass OAuth2 token to credential provider URL: https://github.com/apache/iceberg/pull/11971 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

Re: [PR] Doc: Add DELETE ORPHAN-FILES example to Hive page [iceberg]

2025-01-15 Thread via GitHub
pvary merged PR #11896: URL: https://github.com/apache/iceberg/pull/11896 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apa

Re: [PR] Doc: Add DELETE ORPHAN-FILES example to Hive page [iceberg]

2025-01-15 Thread via GitHub
pvary commented on PR #11896: URL: https://github.com/apache/iceberg/pull/11896#issuecomment-2592352317 Merged to main. Thanks for the PR @ebyhr! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

Re: [PR] Spark 3.5: Fix empty scan issue when start timestamp retrieves root snapshot and end timestamp is missing [iceberg]

2025-01-15 Thread via GitHub
Fokko commented on code in PR #11967: URL: https://github.com/apache/iceberg/pull/11967#discussion_r1916371869 ## spark/v3.5/spark/src/main/java/org/apache/iceberg/spark/source/SparkScanBuilder.java: ## @@ -561,14 +561,11 @@ public Scan buildChangelogScan() { boolean empt

Re: [PR] Doc: Add DELETE ORPHAN-FILES example to Hive page [iceberg]

2025-01-15 Thread via GitHub
pvary commented on PR #11896: URL: https://github.com/apache/iceberg/pull/11896#issuecomment-2591901654 @ebyhr: Is it available in Hive 4.0.0? Do we have other maintenance features, like compaction, manifest rewrite, expire snapshot that are available in Hive as well? -- This is an autom

Re: [PR] [WIP]: Data, Flink, Spark: Set PUFFIN as delete file format for V3 [iceberg]

2025-01-15 Thread via GitHub
Fokko commented on code in PR #11588: URL: https://github.com/apache/iceberg/pull/11588#discussion_r1916175700 ## spark/v3.5/spark/src/main/java/org/apache/iceberg/spark/source/SparkPositionDeltaWrite.java: ## @@ -792,13 +794,14 @@ private static class Context implements Seriali

Re: [PR] Spark 3.4: Add view support to SparkSessionCatalog [iceberg]

2025-01-15 Thread via GitHub
nastra commented on PR #11797: URL: https://github.com/apache/iceberg/pull/11797#issuecomment-2592091291 @danielcweeks I've rebased the PR now, thanks for checking -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the UR

Re: [PR] ci: check MSRV correctly [iceberg-rust]

2025-01-15 Thread via GitHub
Xuanwo merged PR #849: URL: https://github.com/apache/iceberg-rust/pull/849 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.a

Re: [PR] Azure: Support vended credentials refresh in ADLSFileIO. [iceberg]

2025-01-15 Thread via GitHub
nastra commented on PR #11577: URL: https://github.com/apache/iceberg/pull/11577#issuecomment-2592109791 @ChaladiMohanVamsi can you please make sure that the same improvements from https://github.com/apache/iceberg/pull/11954 are also applied here? -- This is an automated message from the

Re: [I] Partition writes not creating expected directory hierarchy on S3 (MinIO) [iceberg-rust]

2025-01-15 Thread via GitHub
ZENOTME commented on issue #891: URL: https://github.com/apache/iceberg-rust/issues/891#issuecomment-2592111636 > Thank you, [@ZENOTME](https://github.com/ZENOTME), for the response. > > I work with a startup building a distributed query engine for large-scale Iceberg tables (>1PB). T

Re: [I] Transient AWS Connection Issues [iceberg]

2025-01-15 Thread via GitHub
steveloughran commented on issue #11412: URL: https://github.com/apache/iceberg/issues/11412#issuecomment-2592960320 Even if you share a single client, the v2 SDK IAM credential provider on EC2 (not k8s) is unable to recover from 503 errors during is refresh, as it tries to refresh the cred

Re: [I] [DISCUSSION] Project Goal [iceberg-cpp]

2025-01-15 Thread via GitHub
lidavidm commented on issue #2: URL: https://github.com/apache/iceberg-cpp/issues/2#issuecomment-2594055888 You could imagine a core library that only does planning/parsing, and doesn't return data at all, though. (Just: read these data files, apply these delete files, etc.) -- This is a

Re: [PR] Core: Parsing and Writing Tests for V3 Metadata [iceberg]

2025-01-15 Thread via GitHub
rdblue commented on code in PR #11947: URL: https://github.com/apache/iceberg/pull/11947#discussion_r1917410896 ## core/src/test/java/org/apache/iceberg/TestDataTaskParser.java: ## @@ -193,10 +193,26 @@ private DataTask createDataTask() { List snapshots = Arrays.

Re: [PR] Core: Parsing and Writing Tests for V3 Metadata [iceberg]

2025-01-15 Thread via GitHub
RussellSpitzer commented on code in PR #11947: URL: https://github.com/apache/iceberg/pull/11947#discussion_r1917401890 ## core/src/test/java/org/apache/iceberg/TestSnapshotJson.java: ## @@ -37,13 +34,16 @@ public class TestSnapshotJson { @Test public void testJsonConver

Re: [PR] Build: Add plugin to generate license and notice files [iceberg]

2025-01-15 Thread via GitHub
RussellSpitzer commented on PR #11977: URL: https://github.com/apache/iceberg/pull/11977#issuecomment-2594075737 I liked the idea that Ryan suggested this morning, where our plugin would always generate and check the generated files against a set of committed existing files and report an er

[PR] Build: Bump mypy-boto3-glue from 1.35.93 to 1.36.0 [iceberg-python]

2025-01-15 Thread via GitHub
dependabot[bot] opened a new pull request, #1522: URL: https://github.com/apache/iceberg-python/pull/1522 Bumps [mypy-boto3-glue](https://github.com/youtype/mypy_boto3_builder) from 1.35.93 to 1.36.0. Release notes Sourced from https://github.com/youtype/mypy_boto3_builder/releases

Re: [PR] Core: Parsing and Writing Tests for V3 Metadata [iceberg]

2025-01-15 Thread via GitHub
RussellSpitzer commented on code in PR #11947: URL: https://github.com/apache/iceberg/pull/11947#discussion_r1917469252 ## core/src/test/java/org/apache/iceberg/MetadataTestUtils.java: ## @@ -0,0 +1,336 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or

Re: [PR] Parquet: Add readers and writers for the internal object model [iceberg]

2025-01-15 Thread via GitHub
ajantha-bhat commented on code in PR #11904: URL: https://github.com/apache/iceberg/pull/11904#discussion_r1917518515 ## parquet/src/main/java/org/apache/iceberg/data/parquet/BaseParquetReaders.java: ## @@ -76,6 +67,31 @@ protected ParquetValueReader createReader( protected a

Re: [I] cannot load table thru glue catalog [iceberg-python]

2025-01-15 Thread via GitHub
xpj01 commented on issue #1501: URL: https://github.com/apache/iceberg-python/issues/1501#issuecomment-2594289664 I made it work locally, but it's not working in my EC2 instance. Anyway I think this may due to some environment issue I can look into deeper. Thanks for the information! @kevi

Re: [I] cannot load table thru glue catalog [iceberg-python]

2025-01-15 Thread via GitHub
xpj01 closed issue #1501: cannot load table thru glue catalog URL: https://github.com/apache/iceberg-python/issues/1501 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsub

Re: [PR] feat: add file_io and local fs impl [iceberg-cpp]

2025-01-15 Thread via GitHub
zhjwpku commented on PR #30: URL: https://github.com/apache/iceberg-cpp/pull/30#issuecomment-2594498304 > For instance, async vs sync: [#2 (comment)](https://github.com/apache/iceberg-cpp/issues/2#issuecomment-2522607394) Can we provide both? > > Or, whether the core librar

Re: [PR] Bump up spark to 3.5.4 [iceberg-python]

2025-01-15 Thread via GitHub
kevinjqliu commented on PR #1521: URL: https://github.com/apache/iceberg-python/pull/1521#issuecomment-2594230922 I dont think we can upgrade to `3.5.4` yet per this comment https://github.com/apache/iceberg-python/pull/1461#issuecomment-2572498850. This will be unblocked with the upcoming

Re: [PR] Core: Parsing and Writing Tests for V3 Metadata [iceberg]

2025-01-15 Thread via GitHub
rdblue commented on code in PR #11947: URL: https://github.com/apache/iceberg/pull/11947#discussion_r1917406478 ## core/src/test/java/org/apache/iceberg/MetadataTestUtils.java: ## @@ -0,0 +1,428 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or more co

Re: [PR] Core: Parsing and Writing Tests for V3 Metadata [iceberg]

2025-01-15 Thread via GitHub
rdblue commented on code in PR #11947: URL: https://github.com/apache/iceberg/pull/11947#discussion_r1917406002 ## core/src/test/java/org/apache/iceberg/MetadataTestUtils.java: ## @@ -0,0 +1,336 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or more co

Re: [PR] Spark 3.5: Fix flaky tests `withSnapshotIsolation` [iceberg]

2025-01-15 Thread via GitHub
RussellSpitzer commented on PR #11974: URL: https://github.com/apache/iceberg/pull/11974#issuecomment-2594078003 Doesn't this mean that our test is allowed to run 1280 seconds before failing? -- This is an automated message from the Apache Git Service. To respond to the message, please lo

Re: [PR] Spec: Document Snapshot Summary Optional Fields for Standardization [iceberg]

2025-01-15 Thread via GitHub
HonahX commented on code in PR #11660: URL: https://github.com/apache/iceberg/pull/11660#discussion_r1917473314 ## format/spec.md: ## @@ -1633,3 +1633,50 @@ might indicate different snapshot IDs for a specific timestamp. The discrepancie When processing point in time queries

Re: [I] Generate iceberg metadata file based on _spark_metadata [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #9270: Generate iceberg metadata file based on _spark_metadata URL: https://github.com/apache/iceberg/issues/9270 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [I] The query result of `col > x` may be incorrect when there are NaN values in the column `col` [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #9130: The query result of `col > x` may be incorrect when there are NaN values in the column `col` URL: https://github.com/apache/iceberg/issues/9130 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to Git

Re: [I] Cannot delete files cleanly with CatalogUtil::dropTableData [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #9164: Cannot delete files cleanly with CatalogUtil::dropTableData URL: https://github.com/apache/iceberg/issues/9164 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

Re: [I] Spark Rewrite Write Performance [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #9293: URL: https://github.com/apache/iceberg/issues/9293#issuecomment-2594192525 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] Spark Rewrite Write Performance [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #9293: Spark Rewrite Write Performance URL: https://github.com/apache/iceberg/issues/9293 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To un

Re: [I] Spec Inconsistency: Avro type mapping is incomplete [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #9307: Spec Inconsistency: Avro type mapping is incomplete URL: https://github.com/apache/iceberg/issues/9307 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spe

Re: [I] FileNotFoundException can occur in some scenarios. (data file & HADOOP CATALOG) [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #9327: URL: https://github.com/apache/iceberg/issues/9327#issuecomment-2594192645 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] `org.apache.iceberg.actions.RewriteDataFiles` implementation for Apache Flink [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #9306: URL: https://github.com/apache/iceberg/issues/9306#issuecomment-2594192568 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] Considering adjust the default row-group size of Parquet position delete file [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #9149: URL: https://github.com/apache/iceberg/issues/9149#issuecomment-2594192180 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] Spec Inconsistency: Avro type mapping is incomplete [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #9307: URL: https://github.com/apache/iceberg/issues/9307#issuecomment-2594192589 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [PR] API: New API For sequential / streaming updates [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on PR #9323: URL: https://github.com/apache/iceberg/pull/9323#issuecomment-2594192610 This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull

Re: [I] position delete in BaseEqualityDeltaWriter write function will lead to unstable result when equalityFieldColumns is not null and upsert is false [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #9299: position delete in BaseEqualityDeltaWriter write function will lead to unstable result when equalityFieldColumns is not null and upsert is false URL: https://github.com/apache/iceberg/issues/9299 -- This is an automated message from the Apache Git Serv

Re: [I] The snapshots_id is not found in the table.snapshots [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #9140: URL: https://github.com/apache/iceberg/issues/9140#issuecomment-2594192134 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] Create Branches / TAGS between 2 snapshots [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #9281: Create Branches / TAGS between 2 snapshots URL: https://github.com/apache/iceberg/issues/9281 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific co

Re: [I] metadata json conflict when streaming [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #9171: metadata json conflict when streaming URL: https://github.com/apache/iceberg/issues/9171 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

Re: [I] Unclosed input streams when writing with high throughput [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #9148: URL: https://github.com/apache/iceberg/issues/9148#issuecomment-2594192158 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] Hive: Unwrap RuntimeException for Hive InvalidOperationException [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #9289: URL: https://github.com/apache/iceberg/issues/9289#issuecomment-2594192502 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] FileNotFoundException can occur in some scenarios. (data file & HADOOP CATALOG) [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #9327: FileNotFoundException can occur in some scenarios. (data file & HADOOP CATALOG) URL: https://github.com/apache/iceberg/issues/9327 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use t

Re: [I] flink programs sometimes fail to write to icebergTable. The.avro file in metadata cannot be found [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #9168: flink programs sometimes fail to write to icebergTable. The.avro file in metadata cannot be found URL: https://github.com/apache/iceberg/issues/9168 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to

Re: [I] Querying metadata tables for a branch or tag [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #9279: URL: https://github.com/apache/iceberg/issues/9279#issuecomment-2594192445 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] Allow for - in Glue Catalog DB/Table names [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #9169: URL: https://github.com/apache/iceberg/issues/9169#issuecomment-2594192274 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] Hive: Unwrap RuntimeException for Hive InvalidOperationException [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #9289: Hive: Unwrap RuntimeException for Hive InvalidOperationException URL: https://github.com/apache/iceberg/issues/9289 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

Re: [I] `org.apache.iceberg.actions.RewriteDataFiles` implementation for Apache Flink [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #9306: `org.apache.iceberg.actions.RewriteDataFiles` implementation for Apache Flink URL: https://github.com/apache/iceberg/issues/9306 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

Re: [PR] feat: add file_io and local fs impl [iceberg-cpp]

2025-01-15 Thread via GitHub
lidavidm commented on code in PR #30: URL: https://github.com/apache/iceberg-cpp/pull/30#discussion_r1917497458 ## src/iceberg/io/file_io.h: ## @@ -0,0 +1,78 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one + * or more contributor license agreements. See th

Re: [PR] Parquet: Add readers and writers for the internal object model [iceberg]

2025-01-15 Thread via GitHub
rdblue commented on code in PR #11904: URL: https://github.com/apache/iceberg/pull/11904#discussion_r1917500263 ## parquet/src/main/java/org/apache/iceberg/data/parquet/BaseParquetWriter.java: ## @@ -50,6 +42,27 @@ protected ParquetValueWriter createWriter(MessageType type) {

Re: [PR] Build: Add plugin to generate license and notice files [iceberg]

2025-01-15 Thread via GitHub
bryanck commented on PR #11977: URL: https://github.com/apache/iceberg/pull/11977#issuecomment-2594424405 > I liked the idea that Ryan suggested this morning, where our plugin would always generate and check the generated files against a set of committed existing files and report an error i

Re: [PR] Spark: Fix empty scan issue when start timestamp retrieves root snapshot and end timestamp is missing [iceberg]

2025-01-15 Thread via GitHub
flyrain commented on code in PR #11967: URL: https://github.com/apache/iceberg/pull/11967#discussion_r1917634187 ## spark/v3.5/spark/src/main/java/org/apache/iceberg/spark/source/SparkScanBuilder.java: ## @@ -561,14 +561,11 @@ public Scan buildChangelogScan() { boolean em

Re: [PR] Core: Parsing and Writing Tests for V3 Metadata [iceberg]

2025-01-15 Thread via GitHub
RussellSpitzer commented on code in PR #11947: URL: https://github.com/apache/iceberg/pull/11947#discussion_r1917405165 ## core/src/test/java/org/apache/iceberg/TestTableMetadata.java: ## @@ -160,31 +155,16 @@ public void testJsonConversion() throws Exception {

Re: [PR] Parquet: Add readers and writers for the internal object model [iceberg]

2025-01-15 Thread via GitHub
rdblue commented on code in PR #11904: URL: https://github.com/apache/iceberg/pull/11904#discussion_r1917496847 ## parquet/src/main/java/org/apache/iceberg/data/parquet/InternalWriter.java: ## @@ -0,0 +1,72 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one +

Re: [I] Spark: Session level Iceberg table config defaults [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #2270: URL: https://github.com/apache/iceberg/issues/2270#issuecomment-2594191333 This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs.

Re: [I] Spark: read migrate table error [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #2896: URL: https://github.com/apache/iceberg/issues/2896#issuecomment-2594191359 This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs.

Re: [I] Flink: Decouple the iceberg integration work from hadoop libraries [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #3117: Flink: Decouple the iceberg integration work from hadoop libraries URL: https://github.com/apache/iceberg/issues/3117 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

Re: [I] Flink: Decouple the iceberg integration work from hadoop libraries [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #3117: URL: https://github.com/apache/iceberg/issues/3117#issuecomment-2594191394 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] How to move Iceberg table from one location to another [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #3142: URL: https://github.com/apache/iceberg/issues/3142#issuecomment-2594191413 This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs.

Re: [PR] Encryption integration and test [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on PR #5544: URL: https://github.com/apache/iceberg/pull/5544#issuecomment-2594191536 This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull

Re: [PR] API,Core: Introduce metrics for data files by file format [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on PR #5837: URL: https://github.com/apache/iceberg/pull/5837#issuecomment-2594191590 This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull

Re: [I] Partitions metadata table shows old partitions [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #6257: URL: https://github.com/apache/iceberg/issues/6257#issuecomment-2594191633 This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs.

Re: [I] insert to hive table with icberg table format is failing [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #7840: URL: https://github.com/apache/iceberg/issues/7840#issuecomment-2594191736 This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs.

Re: [I] spark v3.2 RoaringBitMap dependency is causing build failure [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #7909: URL: https://github.com/apache/iceberg/issues/7909#issuecomment-2594191766 This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs.

Re: [I] Can not parse sort order when there are more than one sql extensions specified and iceberg extension is not placed at the right most position [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #8004: URL: https://github.com/apache/iceberg/issues/8004#issuecomment-2594191789 This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs.

Re: [PR] Parquet: Add readers and writers for the internal object model [iceberg]

2025-01-15 Thread via GitHub
rdblue commented on code in PR #11904: URL: https://github.com/apache/iceberg/pull/11904#discussion_r1917497895 ## parquet/src/main/java/org/apache/iceberg/data/parquet/InternalWriter.java: ## @@ -0,0 +1,72 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one +

Re: [I] Why Iceberg do not support column with default value? [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #8527: URL: https://github.com/apache/iceberg/issues/8527#issuecomment-2594191853 This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs.

Re: [I] Time Travel / Snapshot Performance [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #9112: Time Travel / Snapshot Performance URL: https://github.com/apache/iceberg/issues/9112 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [I] Decouple modules Hive 2 and 3 Versions and make Hive 3 as default [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #9113: Decouple modules Hive 2 and 3 Versions and make Hive 3 as default URL: https://github.com/apache/iceberg/issues/9113 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

Re: [I] Time Travel / Snapshot Performance [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #9112: URL: https://github.com/apache/iceberg/issues/9112#issuecomment-2594191997 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] TBLPROPERTIES('history.expire.max-snapshot-age-ms') doesn't work [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #9123: TBLPROPERTIES('history.expire.max-snapshot-age-ms') doesn't work URL: https://github.com/apache/iceberg/issues/9123 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

Re: [I] Decouple modules Hive 2 and 3 Versions and make Hive 3 as default [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #9113: URL: https://github.com/apache/iceberg/issues/9113#issuecomment-2594192014 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] java.lang.ClassNotFoundException: Failed to find data source: iceberg. Issue when we are using Java Custom Catalog [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #9275: URL: https://github.com/apache/iceberg/issues/9275#issuecomment-2594192391 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [PR] Core: Avro writers use BlockingBinaryEncoder to enable array/map size calculations. [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on PR #8625: URL: https://github.com/apache/iceberg/pull/8625#issuecomment-2594191886 This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull

Re: [PR] Spark: support rewrite on specified target branch [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on PR #8797: URL: https://github.com/apache/iceberg/pull/8797#issuecomment-2594191922 This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull

Re: [I] iceberg reports an error after upgrading to 1.4.2 [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] closed issue #9018: iceberg reports an error after upgrading to 1.4.2 URL: https://github.com/apache/iceberg/issues/9018 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the speci

Re: [I] iceberg reports an error after upgrading to 1.4.2 [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #9018: URL: https://github.com/apache/iceberg/issues/9018#issuecomment-2594191969 This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale' -- This is an automated message from the Apache Gi

Re: [I] Spark: Support alter partition in V2 Catalog SparkCatalog Class [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #3558: URL: https://github.com/apache/iceberg/issues/3558#issuecomment-2594191454 This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs.

Re: [I] Encryption via RewriteFiles [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #3928: URL: https://github.com/apache/iceberg/issues/3928#issuecomment-2594191497 This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs.

Re: [I] Integrate pyiceberg with Dask [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on issue #5800: URL: https://github.com/apache/iceberg/issues/5800#issuecomment-2594191567 This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs.

Re: [PR] API,Core: Support Conditional Commits [iceberg]

2025-01-15 Thread via GitHub
github-actions[bot] commented on PR #6513: URL: https://github.com/apache/iceberg/pull/6513#issuecomment-2594191655 This pull request has been marked as stale due to 30 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull

  1   2   3   >