This is an automated email from the ASF dual-hosted git repository.
gurwls223 pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new df1280cb10e [MINOR][DOCS] Correct Python Spark Connect documentation
about pip installation
df1280cb10e is described below
commit df1280cb10ee71ea362a95705f355402e2bcaff2
Author: Hyukjin Kwon <[email protected]>
AuthorDate: Tue Nov 21 14:38:24 2023 +0900
[MINOR][DOCS] Correct Python Spark Connect documentation about pip
installation
### What changes were proposed in this pull request?
This PR fixes the Spark Connect documentation from `pyspark==3.5.0` to
`pyspark[connect]==3.5.0`; otherwise it will fail to execute the example as is
because of missing dependencies. This is sort of a followup of SPARK-44867.
https://github.com/apache/spark/blob/d31c8596cd714766892d1395e30358bd1cd3cb84/python/setup.py#L325-L332
### Why are the changes needed?
To guide users about using Spark Connect
### Does this PR introduce _any_ user-facing change?
Yes, this fixes the user-facing documentation for Python Spark Connect.
### How was this patch tested?
Manually checked with Markdown editor.
### Was this patch authored or co-authored using generative AI tooling?
No.
Closes #43919 from HyukjinKwon/SPARK-44867-followup.
Authored-by: Hyukjin Kwon <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
---
docs/spark-connect-overview.md | 4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)
diff --git a/docs/spark-connect-overview.md b/docs/spark-connect-overview.md
index c7bad0994a8..166497d47b2 100644
--- a/docs/spark-connect-overview.md
+++ b/docs/spark-connect-overview.md
@@ -279,11 +279,11 @@ The connection may also be programmatically created using
_SparkSession#builder_
<div data-lang="python" markdown="1">
-First, install PySpark with `pip install pyspark==3.5.0` or if building a
packaged PySpark application/library,
+First, install PySpark with `pip install pyspark[connect]==3.5.0` or if
building a packaged PySpark application/library,
add it your setup.py file as:
{% highlight python %}
install_requires=[
-'pyspark==3.5.0'
+'pyspark[connect]==3.5.0'
]
{% endhighlight %}
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]