bitsondatadev commented on code in PR #9819: URL: https://github.com/apache/iceberg/pull/9819#discussion_r1507058032
########## docs/docs/configuration.md: ########## @@ -122,16 +122,16 @@ The value of these properties are not persisted as a part of the table metadata. Iceberg catalogs support using catalog properties to configure catalog behaviors. Here is a list of commonly used catalog properties: -| Property | Default | Description | -| --------------------------------- | ------------------ | ------------------------------------------------------ | -| catalog-impl | null | a custom `Catalog` implementation to use by an engine | -| io-impl | null | a custom `FileIO` implementation to use in a catalog | -| warehouse | null | the root path of the data warehouse | -| uri | null | a URI string, such as Hive metastore URI | -| clients | 2 | client pool size | -| cache-enabled | true | Whether to cache catalog entries | -| cache.expiration-interval-ms | 30000 | How long catalog entries are locally cached, in milliseconds; 0 disables caching, negative values disable expiration | -| metrics-reporter-impl | org.apache.iceberg.metrics.LoggingMetricsReporter | Custom `MetricsReporter` implementation to use in a catalog. See the [Metrics reporting](metrics-reporting) section for additional details | +| Property | Default | Description | +| --------------------------------- | ------------------ |-----------------------------------------------------------------------------------------------------------------------------------------------| +| catalog-impl | null | a custom `Catalog` implementation to use by an engine | +| io-impl | null | a custom `FileIO` implementation to use in a catalog | +| warehouse | null | the root path of the data warehouse | +| uri | null | a URI string, such as Hive metastore URI | +| clients | 2 | client pool size | +| cache-enabled | true | Whether to cache catalog entries | +| cache.expiration-interval-ms | 30000 | How long catalog entries are locally cached, in milliseconds; 0 disables caching, negative values disable expiration | +| metrics-reporter-impl | org.apache.iceberg.metrics.LoggingMetricsReporter | Custom `MetricsReporter` implementation to use in a catalog. See the [Metrics reporting](metrics-reporting.md) section for additional details | Review Comment: Don't forget these become relative links when you add `.md`. This navigates to <https://iceberg.apache.org/docs/latest/configuration/metrics-reporting>, not <https://iceberg.apache.org/docs/latest/metrics-reporting> ```suggestion | metrics-reporter-impl | org.apache.iceberg.metrics.LoggingMetricsReporter | Custom `MetricsReporter` implementation to use in a catalog. See the [Metrics reporting](../metrics-reporting.md) section for additional details | ``` ########## docs/docs/flink-writes.md: ########## @@ -267,8 +267,8 @@ Check out all the options here: [write-options](flink-configuration.md#write-opt ## Notes Flink streaming write jobs rely on snapshot summary to keep the last committed checkpoint ID, and -store uncommitted data as temporary files. Therefore, [expiring snapshots](../tables/maintenance#expire-snapshots) -and [deleting orphan files](../tables/maintenance#delete-orphan-files) could possibly corrupt +store uncommitted data as temporary files. Therefore, [expiring snapshots](maintenance.md#expire-snapshots) Review Comment: Links to <https://iceberg.apache.org/docs/latest/flink-writes/maintenance/#expire-snapshots> and not to <https://iceberg.apache.org/docs/latest/maintenance/#expire-snapshots>. ```suggestion store uncommitted data as temporary files. Therefore, [expiring snapshots](../maintenance.md#expire-snapshots) ``` ########## docs/docs/spark-configuration.md: ########## @@ -176,19 +176,19 @@ df.write .insertInto("catalog.db.table") ``` -| Spark option | Default | Description | -| ---------------------- | -------------------------- | ------------------------------------------------------------ | -| write-format | Table write.format.default | File format to use for this write operation; parquet, avro, or orc | -| target-file-size-bytes | As per table property | Overrides this table's write.target-file-size-bytes | -| check-nullability | true | Sets the nullable check on fields | -| snapshot-property._custom-key_ | null | Adds an entry with custom-key and corresponding value in the snapshot summary (the `snapshot-property.` prefix is only required for DSv2) | -| fanout-enabled | false | Overrides this table's write.spark.fanout.enabled | -| check-ordering | true | Checks if input schema and table schema are same | -| isolation-level | null | Desired isolation level for Dataframe overwrite operations. `null` => no checks (for idempotent writes), `serializable` => check for concurrent inserts or deletes in destination partitions, `snapshot` => checks for concurrent deletes in destination partitions. | -| validate-from-snapshot-id | null | If isolation level is set, id of base snapshot from which to check concurrent write conflicts into a table. Should be the snapshot before any reads from the table. Can be obtained via [Table API](../../api#table-metadata) or [Snapshots table](../spark-queries#snapshots). If null, the table's oldest known snapshot is used. | -| compression-codec | Table write.(fileformat).compression-codec | Overrides this table's compression codec for this write | -| compression-level | Table write.(fileformat).compression-level | Overrides this table's compression level for Parquet and Avro tables for this write | -| compression-strategy | Table write.orc.compression-strategy | Overrides this table's compression strategy for ORC tables for this write | +| Spark option | Default | Description | +| ---------------------- | -------------------------- |----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| +| write-format | Table write.format.default | File format to use for this write operation; parquet, avro, or orc | +| target-file-size-bytes | As per table property | Overrides this table's write.target-file-size-bytes | +| check-nullability | true | Sets the nullable check on fields | +| snapshot-property._custom-key_ | null | Adds an entry with custom-key and corresponding value in the snapshot summary (the `snapshot-property.` prefix is only required for DSv2) | +| fanout-enabled | false | Overrides this table's write.spark.fanout.enabled | +| check-ordering | true | Checks if input schema and table schema are same | +| isolation-level | null | Desired isolation level for Dataframe overwrite operations. `null` => no checks (for idempotent writes), `serializable` => check for concurrent inserts or deletes in destination partitions, `snapshot` => checks for concurrent deletes in destination partitions. | +| validate-from-snapshot-id | null | If isolation level is set, id of base snapshot from which to check concurrent write conflicts into a table. Should be the snapshot before any reads from the table. Can be obtained via [Table API](api.md#table-metadata) or [Snapshots table](spark-queries.md#snapshots). If null, the table's oldest known snapshot is used. | Review Comment: `spark-queries is relative to spark-configuration so no need for `../` ```suggestion | validate-from-snapshot-id | null | If isolation level is set, id of base snapshot from which to check concurrent write conflicts into a table. Should be the snapshot before any reads from the table. Can be obtained via [Table API](../api.md#table-metadata) or [Snapshots table](spark-queries.md#snapshots). If null, the table's oldest known snapshot is used. | ``` ########## docs/docs/flink.md: ########## @@ -22,22 +22,22 @@ title: "Flink Getting Started" Apache Iceberg supports both [Apache Flink](https://flink.apache.org/)'s DataStream API and Table API. See the [Multi-Engine Support](../../multi-engine-support.md#apache-flink) page for the integration of Apache Flink. -| Feature support | Flink | Notes | -| ----------------------------------------------------------- |-------|----------------------------------------------------------------------------------------| -| [SQL create catalog](#creating-catalogs-and-using-catalogs) | ✔️ | | -| [SQL create database](#create-database) | ✔️ | | -| [SQL create table](#create-table) | ✔️ | | -| [SQL create table like](#create-table-like) | ✔️ | | -| [SQL alter table](#alter-table) | ✔️ | Only support altering table properties, column and partition changes are not supported | -| [SQL drop_table](#drop-table) | ✔️ | | -| [SQL select](#querying-with-sql) | ✔️ | Support both streaming and batch mode | -| [SQL insert into](#insert-into) | ✔️ ️ | Support both streaming and batch mode | -| [SQL insert overwrite](#insert-overwrite) | ✔️ ️ | | -| [DataStream read](#reading-with-datastream) | ✔️ ️ | | -| [DataStream append](#appending-data) | ✔️ ️ | | -| [DataStream overwrite](#overwrite-data) | ✔️ ️ | | -| [Metadata tables](#inspecting-tables) | ✔️ | | -| [Rewrite files action](#rewrite-files-action) | ✔️ ️ | | +| Feature support | Flink | Notes | +|----------------------------------------------------------------|-------|----------------------------------------------------------------------------------------| +| [SQL create catalog](flink-ddl.md#create-catalog) | ✔️ | | +| [SQL create database](flink-ddl.md#create-database) | ✔️ | | +| [SQL create table](flink-ddl.md#create-table) | ✔️ | | +| [SQL create table like](flink-ddl.md#create-table-like) | ✔️ | | +| [SQL alter table](flink-ddl.md#alter-table) | ✔️ | Only support altering table properties, column and partition changes are not supported | +| [SQL drop_table](flink-ddl.md#drop-table) | ✔️ | | +| [SQL select](flink-queries.md#reading-with-sql) | ✔️ | Support both streaming and batch mode | +| [SQL insert into](flink-writes.md#insert-into) | ✔️ ️ | Support both streaming and batch mode | +| [SQL insert overwrite](flink-writes.md#insert-overwrite) | ✔️ ️ | | +| [DataStream read](flink-queries.md#reading-with-datastream) | ✔️ ️ | | +| [DataStream append](flink-writes.md#appending-data) | ✔️ ️ | | +| [DataStream overwrite](flink-writes.md#overwrite-data) | ✔️ ️ | | +| [Metadata tables](flink-queries.md#inspecting-tables) | ✔️ | | +| [Rewrite files action](flink-actions.md#rewrite-files-action) | ✔️ ️ | | Review Comment: ```suggestion | [SQL create catalog](../flink-ddl.md#create-catalog) | ✔️ | | | [SQL create database](../flink-ddl.md#create-database) | ✔️ | | | [SQL create table](../flink-ddl.md#create-table) | ✔️ | | | [SQL create table like](../flink-ddl.md#create-table-like) | ✔️ | | | [SQL alter table](../flink-ddl.md#alter-table) | ✔️ | Only support altering table properties, column and partition changes are not supported | | [SQL drop_table](../flink-ddl.md#drop-table) | ✔️ | | | [SQL select](../flink-queries.md#reading-with-sql) | ✔️ | Support both streaming and batch mode | | [SQL insert into](../flink-writes.md#insert-into) | ✔️ ️ | Support both streaming and batch mode | | [SQL insert overwrite](../flink-writes.md#insert-overwrite) | ✔️ ️ | | | [DataStream read](../flink-queries.md#reading-with-datastream) | ✔️ ️ | | | [DataStream append](../flink-writes.md#appending-data) | ✔️ ️ | | | [DataStream overwrite](../flink-writes.md#overwrite-data) | ✔️ ️ | | | [Metadata tables](../flink-queries.md#inspecting-tables) | ✔️ | | | [Rewrite files action](../flink-actions.md#rewrite-files-action) | ✔️ ️ | | ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org For additional commands, e-mail: issues-h...@iceberg.apache.org