RussellSpitzer commented on code in PR #12931:
URL: https://github.com/apache/iceberg/pull/12931#discussion_r2067376061
##########
docs/docs/spark-configuration.md:
##########
@@ -145,6 +145,59 @@ Using those SQL commands requires adding Iceberg
extensions to your Spark enviro
## Runtime configuration
+### Precedence of Configuration Settings
+Iceberg allows configurations to be specified at different levels. The
effective configuration for a read or write operation is determined based on
the following order of precedence:
+
+1. Read/Write Options – Explicitly passed to `.option(...)` in a read/write
operation.
+
+2. Table Properties – Defined on the Iceberg table via `ALTER TABLE SET
TBLPROPERTIES`.
+
+3. Spark SQL Configurations – Set globally in Spark via `spark.conf.set(...)`,
`spark-defaults.conf`, or `--conf` in spark-submit.
+
+If a setting is not defined at a higher level, the next level is used as
fallback. This allows flexibility while enabling global defaults when needed.
+
+### Spark SQL Options
+
+Iceberg supports setting various global behaviors using Spark SQL
configuration options. These can be set via `spark.conf`, `SparkSession
settings`, or Spark submit arguments.
+For example:
+
+```scala
+// disabling vectorization
+val spark = SparkSession.builder()
+ .appName("IcebergExample")
+ .master("local[*]")
+ .config("spark.sql.catalog.my_catalog",
"org.apache.iceberg.spark.SparkCatalog")
+ .config("spark.sql.extensions",
"org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions")
+ .config("spark.sql.iceberg.vectorization.enabled", "false")
+ .getOrCreate()
+```
+
+| Spark option | Default
| Description |
+|--------------------------------------------------------|------------------------|-------------------------------------------------------------------------|
+| spark.sql.iceberg.vectorization.enabled | Table property
default | Enables vectorized Parquet reads
|
Review Comment:
Table property default => Table default?
That pattern is used elsewhere
I think this property doesn't just apply to "Parquet" as well so the
description probably needs to be changed. "Enables vectorized reads of data
files"?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]