rdblue commented on code in PR #5432:
URL: https://github.com/apache/iceberg/pull/5432#discussion_r950908532


##########
format/gcm-stream-spec.md:
##########
@@ -0,0 +1,87 @@
+---
+title: "AES GCM Stream Spec"
+url: gcm-stream-spec
+toc: true
+disableSidebar: true
+---
+<!--
+ - Licensed to the Apache Software Foundation (ASF) under one or more
+ - contributor license agreements.  See the NOTICE file distributed with
+ - this work for additional information regarding copyright ownership.
+ - The ASF licenses this file to You under the Apache License, Version 2.0
+ - (the "License"); you may not use this file except in compliance with
+ - the License.  You may obtain a copy of the License at
+ -
+ -   http://www.apache.org/licenses/LICENSE-2.0
+ -
+ - Unless required by applicable law or agreed to in writing, software
+ - distributed under the License is distributed on an "AS IS" BASIS,
+ - WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ - See the License for the specific language governing permissions and
+ - limitations under the License.
+ -->
+
+# AES GCM Stream (AGS) file format extension
+
+## Background and Motivation
+
+Iceberg supports a number of data file formats. Two of these formats (Parquet 
and ORC) have built-in encryption capabilities, that allow to protect sensitive 
information in the data files. However, besides the data files, Iceberg tables 
also have metadata files, that keep sensitive information too (e.g., min/max 
values in manifest files, or bloom filter bitsets in puffin files). Metadata 
file formats (AVRO, JSON, Puffin) don't have encryption support.
+
+Moreover, with the exception of Parquet, no Iceberg data or metadata file 
format supports integrity verification, required for end-to-end tamper proofing 
of Iceberg tables.
+
+This document specifies details of a simple file format extension that adds 
encryption and tamper-proofing to any existing file format.

Review Comment:
   I think this should be more clear that the proposal is to encrypt an entire 
stream, which will prevent skipping in the underlying file.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to