Re: [PR] Manual deduction of partitions [iceberg-python]

2025-03-04 Thread via GitHub
Fokko commented on code in PR #1743: URL: https://github.com/apache/iceberg-python/pull/1743#discussion_r1979125483 ## pyiceberg/io/pyarrow.py: ## @@ -2475,18 +2484,25 @@ def parquet_files_to_data_files(io: FileIO, table_metadata: TableMetadata, file_ f"Cannot

Re: [PR] Manual deduction of partitions [iceberg-python]

2025-03-02 Thread via GitHub
afiodorov commented on PR #1743: URL: https://github.com/apache/iceberg-python/pull/1743#issuecomment-2692884445 Hey! The regex was just an example, it's not part of the API though - the partition deduction function is. The issue I am having at a works is that our pipelines keep writi

Re: [PR] Manual deduction of partitions [iceberg-python]

2025-03-01 Thread via GitHub
kevinjqliu commented on PR #1743: URL: https://github.com/apache/iceberg-python/pull/1743#issuecomment-2692351898 Hi @afiodorov thanks for the PR! For adding hive partitioned files to Iceberg, there's a specific way we can do so using column projections, https://iceberg.apache.org/s

[PR] Manual deduction of partitions [iceberg-python]

2025-02-28 Thread via GitHub
afiodorov opened a new pull request, #1743: URL: https://github.com/apache/iceberg-python/pull/1743 I want to a) be able to add files that a partitioned by the filename convention, e.g. s3://bucket/table/year=2025/month=12 b) add files even if they have extra columns without having to mig