This is an automated email from the ASF dual-hosted git repository.
dongjoon pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new 2a5d03a00a03 [SPARK-53854][PYTHON][TESTS] Skip `test_collect_time`
test if pandas or pyarrow are unavailable
2a5d03a00a03 is described below
commit 2a5d03a00a0301b1bdc9af347365d94da15825a9
Author: Dongjoon Hyun <[email protected]>
AuthorDate: Thu Oct 9 00:05:57 2025 -0700
[SPARK-53854][PYTHON][TESTS] Skip `test_collect_time` test if pandas or
pyarrow are unavailable
### What changes were proposed in this pull request?
This PR aims to skip `test_collect_time` test if pandas or pyarrow are
unavailable.
### Why are the changes needed?
According to `Python 3.14` CI, this seems to be the last error of
`pyspark-sql` module due to the missing `pyarrow`.
- https://github.com/apache/spark/actions/workflows/build_python_3.14.yml
- https://github.com/apache/spark/actions/runs/18363201896/job/52310847550
```
======================================================================
ERROR [0.990s]: test_collect_time
(pyspark.sql.tests.test_collection.DataFrameCollectionTests.test_collect_time)
----------------------------------------------------------------------
Traceback (most recent call last):
File "/__w/spark/spark/python/pyspark/sql/pandas/utils.py", line 69, in
require_minimum_pyarrow_version
import pyarrow
ModuleNotFoundError: No module named 'pyarrow'
```
### Does this PR introduce _any_ user-facing change?
No, this is a test case change.
### How was this patch tested?
Manual review.
### Was this patch authored or co-authored using generative AI tooling?
No.
Closes #52555 from dongjoon-hyun/SPARK-53854.
Authored-by: Dongjoon Hyun <[email protected]>
Signed-off-by: Dongjoon Hyun <[email protected]>
---
python/pyspark/sql/tests/test_collection.py | 4 ++++
1 file changed, 4 insertions(+)
diff --git a/python/pyspark/sql/tests/test_collection.py
b/python/pyspark/sql/tests/test_collection.py
index 5c923e0e7558..7655c989efa9 100644
--- a/python/pyspark/sql/tests/test_collection.py
+++ b/python/pyspark/sql/tests/test_collection.py
@@ -365,6 +365,10 @@ class DataFrameCollectionTestsMixin:
break
self.assertEqual(df.take(8), result)
+ @unittest.skipIf(
+ not have_pandas or not have_pyarrow,
+ pandas_requirement_message or pyarrow_requirement_message,
+ )
def test_collect_time(self):
import pandas as pd
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]