larry98 opened a new issue, #43187:
URL: https://github.com/apache/arrow/issues/43187

   ### Describe the enhancement requested
   
   We'd like to use parquet predicate pushdown on `is_in` expressions, but this 
currently isn't supported in `SimplifyWithGuarantee`. We implemented a proof of 
concept where we sort and deduplicate the `is_in` expression's value set, then 
have `SimplifyWithGuarantee` binary search on the inequality bound and slice 
the value set accordingly. This works well, but I'm not sure what the correct 
interface for enabling this code path should be. Our current approach adds a 
new field to `SetLookupOptions` which allows the user to declare whether the 
value set is pre-sorted and deduplicated.
   
   Any thoughts? I'd be happy to put up a PR if we agree on an interface.
   
   ### Component(s)
   
   C++


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to