This is an automated email from the ASF dual-hosted git repository.
gurwls223 pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new 97597ba5284 [SPARK-45406][PYTHON][CONNECT] Delete `schema` from
DataFrame constructor
97597ba5284 is described below
commit 97597ba52842c6fcda5db27171439f9c02c1a782
Author: Ruifeng Zheng <[email protected]>
AuthorDate: Wed Oct 4 14:32:31 2023 +0900
[SPARK-45406][PYTHON][CONNECT] Delete `schema` from DataFrame constructor
### What changes were proposed in this pull request?
Delete `schema` from DataFrame constructor
### Why are the changes needed?
this `schema` field was originally designed to cache the schema, however,
it is never used and we won't cache schema anymore
### Does this PR introduce _any_ user-facing change?
No
### How was this patch tested?
CI
### Was this patch authored or co-authored using generative AI tooling?
No
Closes #43206 from zhengruifeng/connect_df_remove_schema.
Authored-by: Ruifeng Zheng <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
---
python/pyspark/sql/connect/dataframe.py | 2 --
1 file changed, 2 deletions(-)
diff --git a/python/pyspark/sql/connect/dataframe.py
b/python/pyspark/sql/connect/dataframe.py
index 91254044a5c..b0f229a8a43 100644
--- a/python/pyspark/sql/connect/dataframe.py
+++ b/python/pyspark/sql/connect/dataframe.py
@@ -103,10 +103,8 @@ class DataFrame:
def __init__(
self,
session: "SparkSession",
- schema: Optional[StructType] = None,
):
"""Creates a new data frame"""
- self._schema = schema
self._plan: Optional[plan.LogicalPlan] = None
self._session: "SparkSession" = session
# Check whether _repr_html is supported or not, we use it to avoid
calling RPC twice
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]