Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2025-04-29 Thread via GitHub
github-actions[bot] commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-2840513172 This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occu

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-10-31 Thread via GitHub
corleyma commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-2450705382 @kevinjqliu alas it's not as simple for iceberg because of the need to do field id-based projection to handle schema evolution. Somewhat relatedly: from what I remember,

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-10-31 Thread via GitHub
kevinjqliu commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-2450455606 Reference to delta table's `to_pyarrow_dataset` implementation https://github.com/delta-io/delta-rs/blob/3f355d87119661fc7cf28877b620b589277ba1d1/python/deltalake/table.py#L

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-09-25 Thread via GitHub
noklam commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-2374541290 Would love this feature, I am coming from https://github.com/ibis-project/ibis/issues/7712 as well. -- This is an automated message from the Apache Git Service. To respond to the

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-08-23 Thread via GitHub
TiansuYu commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-2306607034 > A PyArrow Dataset can be initiated from a list of file paths: > > Create a FileSystemDataset from explicitly given files. The files must be located on the same filesystem

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-02-29 Thread via GitHub
mfatihaktas commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1971894322 Just to note, we would also love this feature. It would allow us to [support Iceberg read/write in Ibis](https://github.com/ibis-project/ibis/pull/8343). -- This is an autom

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-02-12 Thread via GitHub
wonnor-pro commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1939097823 Hi team, do we have an update on this? We are really excited with this feature. -- This is an automated message from the Apache Git Service. To respond to the message, please

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-02-01 Thread via GitHub
Fokko commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1922231907 I'm always in for [in for creative solutions](https://github.com/pola-rs/polars/pull/10375). I think that would well, also my colleague did something similar: https://gist.github.c

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-02-01 Thread via GitHub
jwills commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1922172352 That is helpful, thank you. One other option I was considering on my side, given that I have access to https://github.com/duckdb/duckdb_iceberg : Using pyiceberg to fetch the

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-02-01 Thread via GitHub
Fokko commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1922123177 Just for context, don't know if it helps. I was recently playing by pushing the union of the tables into Arrow, including all the schema evolution. This would prevent PyIceberg from

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-02-01 Thread via GitHub
jwills commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1922103186 @Fokko agreed, the union approach seems like a perf killer. Will noodle on this a bit more-- thanks for the context here! -- This is an automated message from the Apache Git Serv

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-02-01 Thread via GitHub
Fokko commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1922085729 Hey @jwills Having a union dataset feels like a step in the right direction to me, however I don't think it will really help when it comes to performance. Loading the files th

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-02-01 Thread via GitHub
stefnba commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1922014952 Maybe this helps. A PyArrow Dataset can be initiated from a list of file paths: Create a FileSystemDataset from explicitly given files. The files must be located on t

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-02-01 Thread via GitHub
Fokko commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1921815721 Awesome, let me know if there are any questions. Happy to provide context -- This is an automated message from the Apache Git Service. To respond to the message, please log on to G

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-02-01 Thread via GitHub
jwills commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1921814204 sg @Fokko, will dive in here -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spe

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-02-01 Thread via GitHub
Fokko commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1921812965 Hey @jwills I think many folks are looking forward to this, so it would be great if you would be willing to spend time on getting this in 🙌 -- This is an automated message from t

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-02-01 Thread via GitHub
jwills commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1921794721 Yo, just chiming in that we would love this for dbt-duckdb use cases-- thanks! -- This is an automated message from the Apache Git Service. To respond to the message, please log

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-02-01 Thread via GitHub
jwills commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1921795696 (If this is a thing I can add, please lmk- I can be surprisingly useful) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to G

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2024-01-08 Thread via GitHub
stefnba commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1881808334 Hi, is there any update on this topic? Thanks. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL ab

Re: [I] Expose PyIceberg table as PyArrow Dataset [iceberg-python]

2023-11-26 Thread via GitHub
RahulDubey391 commented on issue #30: URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1827142768 Hi @Fokko , I can have a look into the issue! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the UR