Dirt-Nasty opened a new issue, #744:
URL: https://github.com/apache/arrow-go/issues/744

   ### Describe the bug, including details regarding any error messages, 
version, and platform.
   
   I've decided to use arrow-go to add functionality to my application that 
takes a GRPC stream and writes it out to parquet.  I then use these parquets to 
load into Snowflake tables as part of our historical ETL processing.  I noticed 
once loaded in that there was misplaced data.  There were values on some rows 
that belonged to others.  Primarily on array structs.
   
   After beating my head into the wall debugging this I decided to repack the 
parquet using python/pyarrow.  To my surprise this allowed data to land in the 
correct place upon ingestion.
   
   I really dont want to have to write this into the processing and move from 
Go -> Python before landing into blob storage for ingestion.
   
   I did a search here on issues but didn't find anyone reporting a similar 
issue.
   
   Any insight into some things I might be able to try would be great.
   
   ### Component(s)
   
   Parquet


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to