ischwart1 commented on issue #408: URL: https://github.com/apache/iceberg-python/issues/408#issuecomment-2525291565
@djouallah Apparently you can pass a duckdb connection to `to_duckdb()`, that way you can chain many table together: ```python with ( catalog.load_table(f"{config.lake.namespace}.{config.bronze.map_id}") .scan() .to_duckdb("bronze_map_id") as con1 ): with ( catalog.load_table(f"{config.lake.namespace}.{config.bronze.mob}") .scan() .to_duckdb("bronze_mob", con1) as con2 ): con2.sql("""--sql with mob as ( select unnest(mobs).link as link from bronze_map_id ), links as ( select regexp_extract(link, '/mob/(\\d+)', ['mob_id']).mob_id as mob_id, count(*) as count from mob group by link ) select links.mob_id, name, count from links join bronze_mob on links.mob_id = bronze_mob.id """).show(max_width=1000) ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org