[I] Comet throws RuntimeException instead of SparkException for invalid row index column type [datafusion-comet]

via GitHub Thu, 02 Apr 2026 14:30:48 -0700


andygrove opened a new issue, #3886:
URL: https://github.com/apache/datafusion-comet/issues/3886


   ## Description
   
   The Spark SQL test \`ParquetRowIndexSuite: invalid row index column type - 
vectorized reader\` expects a \`SparkException\` with condition 
\`FAILED_READ_FILE\` when the row index temporary column is declared with the 
wrong type (e.g., \`StringType\` instead of \`LongType\`).
   
   Spark's vectorized reader wraps the \`RuntimeException\` from 
\`findRowIndexColumnIndexInSchema\` in a \`SparkException\` via 
\`QueryExecutionErrors.cannotReadFilesError()\`. Comet's \`NativeBatchReader\` 
lets the \`RuntimeException\` propagate unwrapped.
   
   ## Steps to reproduce
   
   Run the Spark SQL test suite \`ParquetRowIndexSuite\` with Comet enabled 
against Spark 4.0.1.
   
   ## Expected behavior
   
   \`SparkException\` with condition starting with \`FAILED_READ_FILE\` should 
be thrown, matching Spark's native behavior.
   
   ## Actual behavior
   
   \`RuntimeException\` is thrown directly.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[I] Comet throws RuntimeException instead of SparkException for invalid row index column type [datafusion-comet]

Reply via email to