github-actions[bot] commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-2840513172
This issue has been automatically marked as stale because it has been open
for 180 days with no activity. It will be closed in next 14 days if no further
activity occu
corleyma commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-2450705382
@kevinjqliu alas it's not as simple for iceberg because of the need to do
field id-based projection to handle schema evolution.
Somewhat relatedly: from what I remember,
kevinjqliu commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-2450455606
Reference to delta table's `to_pyarrow_dataset` implementation
https://github.com/delta-io/delta-rs/blob/3f355d87119661fc7cf28877b620b589277ba1d1/python/deltalake/table.py#L
noklam commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-2374541290
Would love this feature, I am coming from
https://github.com/ibis-project/ibis/issues/7712 as well.
--
This is an automated message from the Apache Git Service.
To respond to the
TiansuYu commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-2306607034
> A PyArrow Dataset can be initiated from a list of file paths:
>
> Create a FileSystemDataset from explicitly given files. The files must be
located on the same filesystem
mfatihaktas commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1971894322
Just to note, we would also love this feature. It would allow us to [support
Iceberg read/write in Ibis](https://github.com/ibis-project/ibis/pull/8343).
--
This is an autom
wonnor-pro commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1939097823
Hi team, do we have an update on this? We are really excited with this
feature.
--
This is an automated message from the Apache Git Service.
To respond to the message, please
Fokko commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1922231907
I'm always in for [in for creative
solutions](https://github.com/pola-rs/polars/pull/10375). I think that would
well, also my colleague did something similar:
https://gist.github.c
jwills commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1922172352
That is helpful, thank you.
One other option I was considering on my side, given that I have access to
https://github.com/duckdb/duckdb_iceberg : Using pyiceberg to fetch the
Fokko commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1922123177
Just for context, don't know if it helps. I was recently playing by pushing
the union of the tables into Arrow, including all the schema evolution. This
would prevent PyIceberg from
jwills commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1922103186
@Fokko agreed, the union approach seems like a perf killer. Will noodle on
this a bit more-- thanks for the context here!
--
This is an automated message from the Apache Git Serv
Fokko commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1922085729
Hey @jwills Having a union dataset feels like a step in the right direction
to me, however I don't think it will really help when it comes to performance.
Loading the files th
stefnba commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1922014952
Maybe this helps.
A PyArrow Dataset can be initiated from a list of file paths:
Create a FileSystemDataset from explicitly given files. The files must be
located on t
Fokko commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1921815721
Awesome, let me know if there are any questions. Happy to provide context
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to G
jwills commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1921814204
sg @Fokko, will dive in here
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the spe
Fokko commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1921812965
Hey @jwills I think many folks are looking forward to this, so it would be
great if you would be willing to spend time on getting this in 🙌
--
This is an automated message from t
jwills commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1921794721
Yo, just chiming in that we would love this for dbt-duckdb use cases--
thanks!
--
This is an automated message from the Apache Git Service.
To respond to the message, please log
jwills commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1921795696
(If this is a thing I can add, please lmk- I can be surprisingly useful)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to G
stefnba commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1881808334
Hi, is there any update on this topic? Thanks.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL ab
RahulDubey391 commented on issue #30:
URL: https://github.com/apache/iceberg-python/issues/30#issuecomment-1827142768
Hi @Fokko , I can have a look into the issue!
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
UR
20 matches
Mail list logo