ianmcook opened a new issue, #40581:
URL: https://github.com/apache/arrow/issues/40581

   ### Describe the usage question you have. Please include as many useful 
details as  possible.
   
   
   [Chunked transfer 
encoding](https://en.wikipedia.org/wiki/Chunked_transfer_encoding) is commonly 
used in HTTP/1.1. In chunked transfer encoding, special sequences of bytes are 
used to separate the chunks and as the terminating chunk. But what happens if 
those sequences of bytes occur inside binary Arrow IPC data, for example in a 
binary array?
   
   I am almost certain (based on an understanding of how HTTP/1.1 clients work) 
that this will not cause any problems, but we should test to be fully certain.
   
   To test, we could for example use the [simple Python GET 
example](https://github.com/apache/arrow-experiments/tree/main/http/get_simple/python/client),
 replacing the schema and the definition of `GetPutData` with the following:
   ```py
   schema = pa.schema([('a', pa.binary())])
   
   def GetPutData():
       arrays = [pa.array('4\r\nWiki\r\n7\r\npedia i\r\nB\r\nn 
\r\nchunks.\r\n0\r\n\r\nabcdefg', type=pa.binary())]
       batches = [pa.record_batch(arrays, schema), pa.record_batch(arrays, 
schema)]
       return batches
   ```
   
   ### Component(s)
   
   Python


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to