FANNG1 commented on code in PR #10203: URL: https://github.com/apache/gravitino/pull/10203#discussion_r2905224017
########## docs/table-maintenance-service/optimizer.md: ########## @@ -0,0 +1,115 @@ +--- +title: "Table Maintenance Service (Optimizer)" +slug: /table-maintenance-service +keyword: table maintenance, optimizer, statistics, metrics, monitor +license: This software is licensed under the Apache License version 2. +--- + +## What is this service + +The Table Maintenance Service (Optimizer) automates table maintenance by connecting: + +- Statistics and metrics collection +- Rule evaluation and strategy recommendation +- Job template based execution + +The CLI commands and configuration keys use the `optimizer` name. + +## Architecture overview + +The optimizer workflow is based on six parts: + +1. Metadata objects: catalog/schema/table in a metalake. +2. Statistics and metrics: table/partition signals used for decision making. +3. Policies: strategy intent, for example `system_iceberg_compaction`. +4. Job templates: executable contracts, for example built-in Spark templates. +5. Job executor: local or custom backend that runs submitted jobs. +6. Status and logs: REST job state plus local staging logs. Review Comment: will add graph latter -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
