nlgranger opened a new issue, #41961:
URL: https://github.com/apache/arrow/issues/41961
### Describe the enhancement requested
`pyarrow.parquet.write_table` has a compression argument which can take a
dict with column names as keys.
When the column type is a compound type, the user must actually add a suffix
to the name (ex: ".list.element"), otherwise it does not work. This is not
documented anywhere and was clearly not anticipated in the API.
An example of such a column type is:
```
optional group field_id=-1 [LiDARPoseComponent].range_image_return1.shape
(List) {
repeated group field_id=-1 list {
optional int32 field_id=-1 item;
}
}
```
I suspect the same issue arose for the `sorting_columns` arguments, so a
possible fix is to also require column indices instead of names for
`compression`.
### Component(s)
Parquet, Python
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]