kevinjqliu commented on code in PR #11845: URL: https://github.com/apache/iceberg/pull/11845#discussion_r1896903736
########## site/docs/spark-quickstart.md: ########## @@ -267,44 +271,109 @@ To read a table, simply use the Iceberg table's name. df = spark.table("demo.nyc.taxis").show() ``` -### Adding A Catalog +### Adding catalogs -Iceberg has several catalog back-ends that can be used to track tables, like JDBC, Hive MetaStore and Glue. -Catalogs are configured using properties under `spark.sql.catalog.(catalog_name)`. In this guide, -we use JDBC, but you can follow these instructions to configure other catalog types. To learn more, check out -the [Catalog](docs/latest/spark-configuration.md#catalogs) page in the Spark section. +Apache Iceberg provides several catalog implementations to manage tables and enable SQL operations. +Catalogs are configured using properties under `spark.sql.catalog.(catalog_name)`. +You can configure different catalog types, such as JDBC, Hive Metastore, Glue, and REST, to manage Iceberg tables in Spark. -This configuration creates a path-based catalog named `local` for tables under `$PWD/warehouse` and adds support for Iceberg tables to Spark's built-in catalog. +This guide covers the configuration of two popular catalog types: + +* JDBC Catalog +* REST Catalog + +To learn more, check out the [Catalog](docs/latest/spark-configuration.md#catalogs) page in the Spark section. + +#### Configuring JDBC Catalog + +The JDBC catalog stores Iceberg table metadata in a relational database. + +This configuration creates a JDBC-based catalog named `local` for tables under `$PWD/warehouse` and adds support for Iceberg tables to Spark's built-in catalog. + +The JDBC catalog uses file-based SQLite database as the backend. === "CLI" ```sh - spark-sql --packages org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:{{ icebergVersion }}\ + spark-sql --packages org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:{{ icebergVersion }},org.xerial:sqlite-jdbc:3.46.1.3 \ Review Comment: taking on this extra dep since i dont see any iceberg specific package i can use. there is a [`hive-jdbc`](https://mvnrepository.com/artifact/org.apache.hive/hive-jdbc) package -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org