This is an automated email from the ASF dual-hosted git repository.
dongjoon pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new 51973aa4c7b7 [SPARK-45423][SQL] Lower `ParquetWriteSupport` log level
to debug
51973aa4c7b7 is described below
commit 51973aa4c7b7d6b1b0b98b943f4cf78e27475519
Author: Dongjoon Hyun <[email protected]>
AuthorDate: Thu Oct 5 10:21:53 2023 -0700
[SPARK-45423][SQL] Lower `ParquetWriteSupport` log level to debug
### What changes were proposed in this pull request?
This PR aims to lower `ParquetWriteSupport` log level from INFO to DEBUG
### Why are the changes needed?
Currently, `ParquetWriteSupport` is too verbose at INFO level because it
dumps the Parquet file schema per file. Since this is the only log in
`ParquetWriteSupport`, the users can see this via a proper `log4j2.properties`
setting when they want to debug jobs.
```
23/10/05 16:29:43 INFO ParquetOutputFormat: ParquetRecordWriter [block
size: 134217728b, row group padding size: 8388608b, validating: false]
23/10/05 16:29:43 INFO ParquetWriteSupport: Initialized Parquet
WriteSupport with Catalyst schema:
{
"type" : "struct",
"fields" : [ {
"name" : "id",
"type" : "long",
"nullable" : false,
"metadata" : { }
} ]
}
and corresponding Parquet message type:
message spark_schema {
required int64 id;
}
23/10/05 16:29:43 INFO MagicCommitTracker: ...
```
### Does this PR introduce _any_ user-facing change?
No.
### How was this patch tested?
Manual tests.
### Was this patch authored or co-authored using generative AI tooling?
No.
Closes #43230 from dongjoon-hyun/SPARK-45423.
Authored-by: Dongjoon Hyun <[email protected]>
Signed-off-by: Dongjoon Hyun <[email protected]>
---
.../spark/sql/execution/datasources/parquet/ParquetWriteSupport.scala | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git
a/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetWriteSupport.scala
b/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetWriteSupport.scala
index f6248d43c48e..9535bbd585bc 100644
---
a/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetWriteSupport.scala
+++
b/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetWriteSupport.scala
@@ -132,7 +132,7 @@ class ParquetWriteSupport extends WriteSupport[InternalRow]
with Logging {
}
}
- logInfo(
+ logDebug(
s"""Initialized Parquet WriteSupport with Catalyst schema:
|${schema.prettyJson}
|and corresponding Parquet message type:
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]