qzyu999 commented on issue #929:
URL: https://github.com/apache/iceberg-go/issues/929#issuecomment-4636350549

   Hi, all just a heads-up for anyone working on this implementation.
   
   I recently discovered a critical decoding bug in `arrow-go`'s 
`parquet/variant/utils.go` where `valuesize()` incorrectly checks bit 4 instead 
of bit 2 for the array `is_large` flag, leading to silent data corruption when 
handling large Variant arrays. 
   
   I've submitted the fix over on the Arrow side:
   - Issue: https://github.com/apache/arrow-go/issues/839
   - PR: https://github.com/apache/arrow-go/pull/840
   
   Once the Arrow maintainers merge and tag a patch release, `iceberg-go` will 
want to bump its `go.mod` dependency to ensure that Iceberg v3 Variant array 
reading doesn't suffer from data corruption.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to