leekeiabstraction commented on code in PR #2578:
URL: https://github.com/apache/fluss/pull/2578#discussion_r2956282020


##########
fluss-lake/fluss-lake-lance/src/test/java/org/apache/fluss/lake/lance/tiering/LanceTieringITCase.java:
##########
@@ -140,6 +142,112 @@ void testTiering() throws Exception {
         jobClient.cancel().get();
     }
 
+    @Test
+    void testTieringWithNestedRowType() throws Exception {

Review Comment:
   Nit: should we test row of row as well?



##########
fluss-lake/fluss-lake-lance/src/main/java/org/apache/fluss/lake/lance/utils/ArrowDataConverter.java:
##########
@@ -94,6 +95,17 @@ private static void copyVectorData(
             }
         }
 
+        if (shadedVector

Review Comment:
   I wonder if we should be more defensive in this class. Currently if only one 
of the vector class is struct, this will be skipped and use non-struct copy 
logic, failure will appear much later. (Appreciate that this is the convention 
of the class i.e. listvector, consider this as nit)



##########
website/docs/_configs/_partial_config.mdx:
##########
@@ -284,7 +288,7 @@
 | `table.auto-partition.enabled` | `false` | Boolean | Whether enable auto 
partition for the table. Disable by default. When auto partition is enabled, 
the partitions of the table will be created automatically. |
 | `table.auto-partition.key` | `none` | String | This configuration defines 
the time-based partition key to be used for auto-partitioning when a table is 
partitioned with multiple keys. Auto-partitioning utilizes a time-based 
partition key to handle partitions automatically, including creating new ones 
and removing outdated ones, by comparing the time value of the partition with 
the current system time. In the case of a table using multiple partition keys 
(such as a composite partitioning strategy), this feature determines which key 
should serve as the primary time dimension for making auto-partitioning 
decisions.And If the table has only one partition key, this config is not 
necessary. Otherwise, it must be specified. |
 | `table.auto-partition.time-unit` | `DAY` | AutoPartitionTimeUnit | The time 
granularity for auto created partitions. The default value is `DAY`. Valid 
values are `HOUR`, `DAY`, `MONTH`, `QUARTER`, `YEAR`. If the value is `HOUR`, 
the partition format for auto created is yyyyMMddHH. If the value is `DAY`, the 
partition format for auto created is yyyyMMdd. If the value is `MONTH`, the 
partition format for auto created is yyyyMM. If the value is `QUARTER`, the 
partition format for auto created is yyyyQ. If the value is `YEAR`, the 
partition format for auto created is yyyy. |
-| `table.auto-partition.time-zone` | `Europe/Paris` | String | The time zone 
for auto partitions, which is by default the same as the system time zone. |
+| `table.auto-partition.time-zone` | `Asia/Shanghai` | String | The time zone 
for auto partitions, which is by default the same as the system time zone. |

Review Comment:
   Ditto



##########
website/docs/_configs/_partial_config.mdx:
##########
@@ -238,9 +238,13 @@
 
 | Key | Default | Type | Description |
 | :--- | :--- | :--- | :--- |
-| `remote.data.dir` | `none` | String | The directory used for storing the kv 
snapshot data files and remote log for log tiered storage  in a Fluss supported 
filesystem. |
+| `remote.data.dir` | `none` | String | The directory used for storing the kv 
snapshot data files and remote log for log tiered storage in a Fluss supported 
filesystem. When upgrading to `remote.data.dirs`, please ensure this value is 
placed as the first entry in the new configuration.For new clusters, it is 
recommended to use `remote.data.dirs` instead. If `remote.data.dirs` is 
configured, this value will be ignored. |
+| `remote.data.dirs` | `[]` | ArrayList | A comma-separated list of 
directories in Fluss supported filesystems for storing the kv snapshot data 
files and remote log files of tables/partitions. If configured, when a new 
table or a new partition is created, one of the directories from this list will 
be selected according to the strategy specified by `remote.data.dirs.strategy` 
(`ROUND_ROBIN` by default). If not configured, the system uses 
`remote.data.dir` as the sole remote data directory for all data. |

Review Comment:
   Are these part of the lance row support change?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to