This is an automated email from the ASF dual-hosted git repository.

dongjoon pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git


The following commit(s) were added to refs/heads/master by this push:
     new b3e6d97a06f0 [SPARK-55385][CORE][SQL][FOLLOW-UP] 
`getAncestorWithSamePartitionSizes` should stop at checkpointed ancestor
b3e6d97a06f0 is described below

commit b3e6d97a06f02df822744add3b9f95148ea0ea65
Author: Ruifeng Zheng <[email protected]>
AuthorDate: Fri Feb 13 11:52:31 2026 -0800

    [SPARK-55385][CORE][SQL][FOLLOW-UP] `getAncestorWithSamePartitionSizes` 
should stop at checkpointed ancestor
    
    ### What changes were proposed in this pull request?
    `getAncestorWithSamePartitionSizes` should stop at checkpointed ancestor
    
    ### Why are the changes needed?
    if the ancestor is already checkpointed, no matter reliably or locally, it 
should be returned
    
    ### Does this PR introduce _any_ user-facing change?
    no
    
    ### How was this patch tested?
    ci
    
    ### Was this patch authored or co-authored using generative AI tooling?
    no
    
    Closes #54311 from zhengruifeng/zip_cp.
    
    Authored-by: Ruifeng Zheng <[email protected]>
    Signed-off-by: Dongjoon Hyun <[email protected]>
---
 core/src/main/scala/org/apache/spark/rdd/ZippedWithIndexRDD.scala | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/core/src/main/scala/org/apache/spark/rdd/ZippedWithIndexRDD.scala 
b/core/src/main/scala/org/apache/spark/rdd/ZippedWithIndexRDD.scala
index 98b1f60dbbef..60638282668d 100644
--- a/core/src/main/scala/org/apache/spark/rdd/ZippedWithIndexRDD.scala
+++ b/core/src/main/scala/org/apache/spark/rdd/ZippedWithIndexRDD.scala
@@ -42,7 +42,7 @@ class ZippedWithIndexRDD[T: ClassTag](prev: RDD[T]) extends 
RDD[(T, Long)](prev)
 
   private def getAncestorWithSamePartitionSizes(rdd: RDD[_]): RDD[_] = {
     rdd match {
-      case c: RDD[_] if c.getStorageLevel != StorageLevel.NONE => c
+      case c: RDD[_] if c.isCheckpointed || c.getStorageLevel != 
StorageLevel.NONE => c
       case m: MapPartitionsRDD[_, _] if m.preservesPartitionSizes =>
         getAncestorWithSamePartitionSizes(m.prev)
       case m: MapPartitionsWithEvaluatorRDD[_, _] if m.preservesPartitionSizes 
=>


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to