sheveg opened a new issue, #9958: URL: https://github.com/apache/iceberg/issues/9958
### Apache Iceberg version 1.4.3 ### Query engine Trino ### Please describe the bug 🐞 When using Flink to put data in iceberg using Nessie as catalog, I get an Metadata not found in metadata location for table error when trying to query the data generated by flink. When I query the data generated by trino itself using its CLI, then I do not have this problem. The iceberg connector configurations is: ``` iceberg: |- connector.name=iceberg iceberg.catalog.type=nessie iceberg.nessie-catalog.uri=http://nessie.nessie.svc.cluster.local:19120/api/v1 iceberg.nessie-catalog.default-warehouse-dir=s3://steadops-playground-bucket iceberg.nessie-catalog.ref=main iceberg.register-table-procedure.enabled=true iceberg.file-format=PARQUET hive.s3.path-style-access=true hive.s3.endpoint=http://minio.minio.svc.cluster.local:80 hive.s3.aws-access-key=${ENV:MINIO_ACCESS_KEY} hive.s3.aws-secret-key=${ENV:MINIO_SECRET_KEY} hive.s3.ssl.enabled=false ``` When I create the data using the trino CLI, the .parquet file is named in the schema: <date>_<time>_<uuid>.parquet whereas the data files generated by flink are named as 00000_0_<uuid>.parquet. The metadata files are named according to the same name schema as far as I see: <seq_number>_.metadata.json. The only difference I see, that the metadata directory for flink does not contain any *.stats files. Cross reference to issue in trino repository: https://github.com/trinodb/trino/issues/21007 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org