GitHub user andr-sokolov added a comment to the discussion: [Proposal] Iceberg subsystem for datalake_fdw — design proposal
**Comparing the performance of Greenplum, Starrocks and Trino when reading Iceberg tables** _Greenplum cluster for the test_ The cluster consist of 5 hosts: master, standby and 3 segment hosts. Each host has 4 CPU cores and 16GB RAM. There are 4 primaries on each segment host - one primary per CPU core. The same equipment has been used to run TPC-H queries on Trino и StarRocks. I used TEA (https://github.com/lithium-tech/tea) to read Iceberg tables from Greenplum 6. _Results_ <img width="1280" height="625" alt="1" src="https://github.com/user-attachments/assets/ba2a6703-22c0-409f-959f-47e87f58a930" /> The horizontal axis shows the numbers of the TPC-H test queries, and the vertical axis shows their execution time in seconds. You can find out the exact numbers in the attached [html](https://github.com/user-attachments/files/27520864/11.html). The dark red color means that the query failed. There are `explain analyze verbose`-s for each query in [this file](https://github.com/user-attachments/files/27520800/explain.txt). Greenplum executed only the q06 query faster, other queries are executed significantly slower. GitHub link: https://github.com/apache/cloudberry/discussions/1683#discussioncomment-16852933 ---- This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
