amol- opened a new issue, #41692:
URL: https://github.com/apache/arrow/issues/41692

   ### Describe the enhancement requested
   
   Substrait ExtendedExpressions can be used with PyArrow by converting them to 
`compute.Expression`.
   That way they can be used in the context of projections and filters when 
interacting with a Dataset or Table.
   
   Some pieces are still missing for those to be used in practice and the user 
experience is generally too complex to be convenient. This issue is meant to 
track work that can be done to improve the Substrait experience in PyArrow
   
   - Allow accepting Substrait Message objects directly instead of 
bytes(currently `pc.Expression.from_substrait(projection.SerializeToString())` 
dance is required which is not very convenient)
   - Allow accepting substrait messages directly where a `pc.Expression` is 
accepted instead of having to build the expression from the message
   - Have a way to encode PyArrow schemas to Substrait `NamedStruct`
   - Accept projections as a single substrait `ExtendedExpression` instead of 
having to build multiple different expressions for each projected column.
   
   ### Component(s)
   
   Python


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to