rok opened a new issue, #47288:
URL: https://github.com/apache/arrow/issues/47288

   ### Describe the enhancement requested
   
   In discussion about [making PyArrow tests numpy 
independent](https://github.com/apache/arrow/issues/47255) it came up we could 
more random array generation kernels to produce random test data for PyArrow 
(see 
[RandomArrayGenerator](https://github.com/apache/arrow/blob/main/cpp/src/arrow/testing/random.h)
 for arrow c++ test data generator). Other libraries that use PyArrow could use 
such kernels too (e.g. Pandas uses NumPy).
   
   We have a kernel for producing [random 
float64s](https://arrow.apache.org/docs/cpp/compute.html#random-number-generation)
 ([PR](https://github.com/apache/arrow/pull/11864)), but lack kernels for other 
types.
   
   API-wise we could either expand 
[RandomOptions](https://arrow.apache.org/docs/cpp/api/compute.html#_CPPv4N5arrow7compute13RandomOptionsE)
 with parameters for different types and use `pyarrow.compute.random` kernel or 
provide new kernels e.g. `pyarrow.compute.random_string`.
   
   ### Component(s)
   
   C++


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to