leekeiabstraction commented on code in PR #2836:
URL: https://github.com/apache/fluss/pull/2836#discussion_r2972144070


##########
website/docs/_configs/_partial_config.mdx:
##########
@@ -78,7 +78,7 @@
 | `client.lookup.batch-timeout` | `0 s` | Duration | The maximum time to wait 
for the lookup batch to full, if this timeout is reached, the lookup batch will 
be closed to send. |
 | `client.lookup.max-retries` | `2147483647` | Integer | Setting a value 
greater than zero will cause the client to resend any lookup request that fails 
with a potentially transient error. |
 | `client.scanner.remote-log.prefetch-num` | `4` | Integer | The number of 
remote log segments to keep in local temp file for LogScanner, which download 
from remote storage. The default setting is 4. |
-| `client.scanner.io.tmpdir` | 
`/var/folders/bp/v2l48kz51mx86d743qv0zhzh0000gn/T//fluss` | String | Local 
directory that is used by client for storing the data files (like kv snapshot, 
log segment files) to read temporarily |
+| `client.scanner.io.tmpdir` | 
`/var/folders/7r/lwdsh9ms4gn0fnxs8c6fcfpm0000gn/T//fluss` | String | Local 
directory that is used by client for storing the data files (like kv snapshot, 
log segment files) to read temporarily |

Review Comment:
   Looks like unrelated changes crept back into this file



##########
website/docs/_configs/_partial_config.mdx:
##########
@@ -238,9 +238,13 @@
 
 | Key | Default | Type | Description |
 | :--- | :--- | :--- | :--- |
-| `remote.data.dir` | `none` | String | The directory used for storing the kv 
snapshot data files and remote log for log tiered storage  in a Fluss supported 
filesystem. |
+| `remote.data.dir` | `none` | String | The directory used for storing the kv 
snapshot data files and remote log for log tiered storage in a Fluss supported 
filesystem. When upgrading to `remote.data.dirs`, please ensure this value is 
placed as the first entry in the new configuration.For new clusters, it is 
recommended to use `remote.data.dirs` instead. If `remote.data.dirs` is 
configured, this value will be ignored. |
+| `remote.data.dirs` | `[]` | ArrayList | A comma-separated list of 
directories in Fluss supported filesystems for storing the kv snapshot data 
files and remote log files of tables/partitions. If configured, when a new 
table or a new partition is created, one of the directories from this list will 
be selected according to the strategy specified by `remote.data.dirs.strategy` 
(`ROUND_ROBIN` by default). If not configured, the system uses 
`remote.data.dir` as the sole remote data directory for all data. |
+| `remote.data.dirs.strategy` | `ROUND_ROBIN` | RemoteDataDirStrategy | The 
strategy for selecting the remote data directory from `remote.data.dirs`. The 
candidate strategies are: [ROUND_ROBIN, WEIGHTED_ROUND_ROBIN], the default 
strategy is ROUND_ROBIN. ROUND_ROBIN: this strategy employs a round-robin 
approach to select one from the available remote directories. 
WEIGHTED_ROUND_ROBIN: this strategy selects one of the available remote 
directories based on the weights configured in `remote.data.dirs.weights`. |
+| `remote.data.dirs.weights` | `[]` | ArrayList | The weights of the remote 
data directories. This is a list of weights corresponding to the 
`remote.data.dirs` in the same order. When `remote.data.dirs.strategy` is set 
to `WEIGHTED_ROUND_ROBIN`, this must be configured, and its size must be equal 
to `remote.data.dirs`; otherwise, it will be ignored. |

Review Comment:
   ditto



##########
website/docs/_configs/_partial_config.mdx:
##########
@@ -238,9 +238,13 @@
 
 | Key | Default | Type | Description |
 | :--- | :--- | :--- | :--- |
-| `remote.data.dir` | `none` | String | The directory used for storing the kv 
snapshot data files and remote log for log tiered storage  in a Fluss supported 
filesystem. |
+| `remote.data.dir` | `none` | String | The directory used for storing the kv 
snapshot data files and remote log for log tiered storage in a Fluss supported 
filesystem. When upgrading to `remote.data.dirs`, please ensure this value is 
placed as the first entry in the new configuration.For new clusters, it is 
recommended to use `remote.data.dirs` instead. If `remote.data.dirs` is 
configured, this value will be ignored. |
+| `remote.data.dirs` | `[]` | ArrayList | A comma-separated list of 
directories in Fluss supported filesystems for storing the kv snapshot data 
files and remote log files of tables/partitions. If configured, when a new 
table or a new partition is created, one of the directories from this list will 
be selected according to the strategy specified by `remote.data.dirs.strategy` 
(`ROUND_ROBIN` by default). If not configured, the system uses 
`remote.data.dir` as the sole remote data directory for all data. |
+| `remote.data.dirs.strategy` | `ROUND_ROBIN` | RemoteDataDirStrategy | The 
strategy for selecting the remote data directory from `remote.data.dirs`. The 
candidate strategies are: [ROUND_ROBIN, WEIGHTED_ROUND_ROBIN], the default 
strategy is ROUND_ROBIN. ROUND_ROBIN: this strategy employs a round-robin 
approach to select one from the available remote directories. 
WEIGHTED_ROUND_ROBIN: this strategy selects one of the available remote 
directories based on the weights configured in `remote.data.dirs.weights`. |
+| `remote.data.dirs.weights` | `[]` | ArrayList | The weights of the remote 
data directories. This is a list of weights corresponding to the 
`remote.data.dirs` in the same order. When `remote.data.dirs.strategy` is set 
to `WEIGHTED_ROUND_ROBIN`, this must be configured, and its size must be equal 
to `remote.data.dirs`; otherwise, it will be ignored. |
 | `remote.fs.write-buffer-size` | `4 kb` | MemorySize | The default size of 
the write buffer for writing the local files to remote file systems. |
 | `remote.log.task-interval-duration` | `1 min` | Duration | Interval at which 
remote log manager runs the scheduled tasks like copy segments, clean up remote 
log segments, delete local log segments etc. If the value is set to 0, it means 
that the remote log storage is disabled. |
+| `remote.log.task-max-upload-segments` | `5` | Integer | The maximum number 
of log segments to upload to remote storage per tiering task execution. This 
limits the upload batch size to prevent overwhelming the remote storage when 
there is a large backlog of segments to upload. |

Review Comment:
   ditto



##########
website/docs/_configs/_partial_config.mdx:
##########
@@ -284,7 +288,7 @@
 | `table.auto-partition.enabled` | `false` | Boolean | Whether enable auto 
partition for the table. Disable by default. When auto partition is enabled, 
the partitions of the table will be created automatically. |
 | `table.auto-partition.key` | `none` | String | This configuration defines 
the time-based partition key to be used for auto-partitioning when a table is 
partitioned with multiple keys. Auto-partitioning utilizes a time-based 
partition key to handle partitions automatically, including creating new ones 
and removing outdated ones, by comparing the time value of the partition with 
the current system time. In the case of a table using multiple partition keys 
(such as a composite partitioning strategy), this feature determines which key 
should serve as the primary time dimension for making auto-partitioning 
decisions.And If the table has only one partition key, this config is not 
necessary. Otherwise, it must be specified. |
 | `table.auto-partition.time-unit` | `DAY` | AutoPartitionTimeUnit | The time 
granularity for auto created partitions. The default value is `DAY`. Valid 
values are `HOUR`, `DAY`, `MONTH`, `QUARTER`, `YEAR`. If the value is `HOUR`, 
the partition format for auto created is yyyyMMddHH. If the value is `DAY`, the 
partition format for auto created is yyyyMMdd. If the value is `MONTH`, the 
partition format for auto created is yyyyMM. If the value is `QUARTER`, the 
partition format for auto created is yyyyQ. If the value is `YEAR`, the 
partition format for auto created is yyyy. |
-| `table.auto-partition.time-zone` | `Europe/Paris` | String | The time zone 
for auto partitions, which is by default the same as the system time zone. |
+| `table.auto-partition.time-zone` | `Asia/Shanghai` | String | The time zone 
for auto partitions, which is by default the same as the system time zone. |

Review Comment:
   ditto



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to