[
https://issues.apache.org/jira/browse/IMPALA-11838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Csaba Ringhofer resolved IMPALA-11838.
--------------------------------------
Fix Version/s: Impala 4.3.0
Resolution: Fixed
> Relax Parquet Version Check
> ---------------------------
>
> Key: IMPALA-11838
> URL: https://issues.apache.org/jira/browse/IMPALA-11838
> Project: IMPALA
> Issue Type: Improvement
> Components: Backend
> Reporter: Micah Kornfield
> Priority: Minor
> Fix For: Impala 4.3.0
>
>
> There is currently a check that verifies the version number of [parquet files
> is equal to
> 1](https://github.com/apache/impala/blob/1e30ca228d683821e42e51f94478c77642f5331a/be/src/exec/parquet/parquet-metadata-utils.cc#L256).
> This seems potentially overly strict because the version isn't necessarily
> [super
> reliable](https://github.com/apache/arrow/blob/a580f2711750ef507cc57ce48cb431dd700a6166/cpp/src/parquet/metadata.h#L326)
> There are also many v2 files that are likely still readable even if the
> reader doesn't support any v2 features.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)