sfc-gh-aixu opened a new issue, #10392:
URL: https://github.com/apache/iceberg/issues/10392

   ### Proposed Change
   
   We would like to propose to add Variant type to Iceberg data types. 
   
   Variant data types allow for the efficient binary encoding of dynamic 
semi-structured data such as JSON, Avro,Parquet, etc. By encoding 
semi-structured data as a variant column, we retain the flexibility of the 
source data, while allowing query engines to more efficiently operate on the 
data.
   
   With the support of Variant type, such data can be encoded in an efficient 
binary representation internally for better performance. Without that, we need 
to parse the data in its format inefficiently.
   
   This will allow the following use cases:
   
   - Create an Iceberg table with a Variant column
   `CREATE OR REPLACE TABLE car_sales(record Variant);`
   - Insert semi-structured data into the Variant column
   `INSERT INTO car_sales SELECT PARSE_JSON(<json_string>)`
   - Query against the semi-structured data
   `SELECT VARIANT_GET(record, '$.dealer.ship', 'string') FROM car_sales`
   
   
   
   
   ### Proposal document
   
https://docs.google.com/document/d/1sq70XDiWJ2DemWyA5dVB80gKzwi0CWoM0LOWM7VJVd8/edit?tab=t.0
   
   ### Specifications
   
   - [X] Table
   - [ ] View
   - [ ] REST
   - [ ] Puffin
   - [ ] Encryption
   - [ ] Other


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to