(ozone-site) branch HDDS-9225-website-v2 updated: HDDS-14326. [Website v2] [Docs] [Administrator Guide] Update "Configuring Network Topology". (#226)

weichiu Fri, 16 Jan 2026 10:59:54 -0800

This is an automated email from the ASF dual-hosted git repository.

weichiu pushed a commit to branch HDDS-9225-website-v2
in repository https://gitbox.apache.org/repos/asf/ozone-site.git



The following commit(s) were added to refs/heads/HDDS-9225-website-v2 by this 
push:
     new 1bdbb454c HDDS-14326. [Website v2] [Docs] [Administrator Guide] Update 
"Configuring Network Topology". (#226)
1bdbb454c is described below

commit 1bdbb454cc372f4ee29c91d6e4a3aaaf7b1f766e
Author: Wei-Chiu Chuang <[email protected]>
AuthorDate: Fri Jan 16 10:59:40 2026 -0800

    HDDS-14326. [Website v2] [Docs] [Administrator Guide] Update "Configuring 
Network Topology". (#226)
---
 .../02-configuration/04-performance/02-topology.md | 43 +++++++++++++---------
 1 file changed, 25 insertions(+), 18 deletions(-)

diff --git 
a/docs/05-administrator-guide/02-configuration/04-performance/02-topology.md 
b/docs/05-administrator-guide/02-configuration/04-performance/02-topology.md
index 3b5fec5e6..4111053d0 100644
--- a/docs/05-administrator-guide/02-configuration/04-performance/02-topology.md
+++ b/docs/05-administrator-guide/02-configuration/04-performance/02-topology.md
@@ -8,7 +8,8 @@ Apache Ozone uses topology information (e.g., rack placement) 
to optimize data a
 
 1. Configured network topology.
 2. Topology-aware Datanode selection for container replica placement (write 
path).
-3. Prioritized reads from topologically closest Datanodes (read path).
+3. Pipeline choosing policy to load balance write traffic.
+4. Prioritized reads from topologically closest Datanodes (read path).
 
 ## Applicability to Container Types
 
@@ -23,11 +24,15 @@ See the [page about 
Containers](../../../core-concepts/replication/storage-conta
 
 Ozone determines Datanode network locations (e.g., racks) using Hadoop's rack 
awareness, configured via `net.topology.node.switch.mapping.impl` in 
`ozone-site.xml`. This key specifies a 
`org.apache.hadoop.net.CachedDNSToSwitchMapping` implementation. [1]
 
+:::note
+Both Ozone Manager (OM) and Storage Container Manager (SCM) use network 
topology information. It is critical to maintain a consistent topology 
assignment across all OM and SCM instances in the cluster.
+:::
+
 Two primary methods exist:
 
 ### 1. Static List: `TableMapping`
 
-Maps IPs/hostnames to racks using a predefined file.
+This is the non-default implementation for static mapping. It maps 
IPs/hostnames to racks using a predefined file. If no mapping file is 
specified, or if a Datanode is not found in the file, it is assigned to 
`/default-rack`.
 
 - **Configuration:** Set `net.topology.node.switch.mapping.impl` to 
`org.apache.hadoop.net.TableMapping` and `net.topology.table.file.name` to the 
mapping file's path. [1]
 
@@ -42,7 +47,7 @@ Maps IPs/hostnames to racks using a predefined file.
   </property>
   ```
 
-- **File Format:** A two-column text file (IP/hostname, rack path per line). 
Unlisted nodes go to `/default-rack`. [1]
+- **File Format:** A two-column text file (IP/hostname, rack path per line). 
[1]
   Example `topology.map`:
 
   ```text
@@ -52,9 +57,9 @@ Maps IPs/hostnames to racks using a predefined file.
   datanode103.example.com /rack2
   ```
 
-### 2. Dynamic List: `ScriptBasedMapping`
+### 2. Dynamic List: `ScriptBasedMapping` (Default)
 
-Uses an external script to resolve rack locations for IPs.
+This is the default implementation. It uses an external script to resolve rack 
locations for IPs.
 
 - **Configuration:** Set `net.topology.node.switch.mapping.impl` to 
`org.apache.hadoop.net.ScriptBasedMapping` and `net.topology.script.file.name` 
to the script's path. [1]
 
@@ -117,7 +122,7 @@ The policy is configured by the 
`ozone.scm.pipeline.placement.impl` property in
 
 After a pool of healthy, open, and rack-aware pipelines has been created, this 
policy is used to **select one** of them to handle a client's write request. 
Its purpose is **load balancing**, not topology awareness, as the topology has 
already been handled during pipeline creation.
 
-The policy is configured by `hdds.scm.pipeline.choose.policy.impl` in 
`ozone-site.xml`.
+The policy is configured by `hdds.scm.pipeline.choose.policy.impl` in 
`ozone-site.xml` for Ratis containers, and 
`hdds.scm.ec.pipeline.choose.policy.impl` for EC containers. All of the 
following policies are supported by both Ratis and EC containers.
 
 - **`RandomPipelineChoosePolicy` (Default):** Selects a pipeline at random 
from the available list. This policy is simple and distributes load without 
considering other metrics.
 - **`CapacityPipelineChoosePolicy`:** Picks two random pipelines and selects 
the one with lower utilization, favoring pipelines with more available capacity.
@@ -128,7 +133,9 @@ Note: When configuring these values, include the full class 
name prefix: for exa
 
 ### 3. Closed Container Replication Policy
 
-This is configured using the `ozone.scm.container.placement.impl` property in 
`ozone-site.xml`. The available policies are:
+#### Ratis containers
+
+This is configured using the `ozone.scm.container.placement.impl` property in 
`ozone-site.xml` for Ratis containers. The available policies are:
 
 - **`SCMContainerPlacementRackAware` (Default)**
 
@@ -148,15 +155,15 @@ This is configured using the 
`ozone.scm.container.placement.impl` property in `o
 
 Note: When configuring these values, include the full class name prefix: for 
example, 
`org.apache.hadoop.hdds.scm.container.placement.algorithms.SCMContainerPlacementCapacity`
 
-## Container Placement for Erasure Coded (EC) Containers
+#### EC containers
 
 For Erasure Coded (EC) containers, SCM employs a specialized placement policy 
to ensure data resilience and availability by distributing data and parity 
blocks across multiple racks. This is configured using the 
`ozone.scm.container.placement.ec.impl.key` property in `ozone-site.xml`.
 
-### 1. `SCMContainerPlacementRackScatter` (Default)
+- **`SCMContainerPlacementRackScatter` (Default)**
 
-- **Function:** This is the default policy for EC containers. It attempts to 
place each block (both data and parity) of an EC container on a different rack. 
For example, for an RS-6-3-1024k container (6 data blocks + 3 parity blocks), 
this policy will try to place the 9 blocks on 9 different racks. This "scatter" 
approach maximizes the fault tolerance, as the loss of a single rack will not 
impact more than one block of the container. [5]
-- **Use Cases:** This policy is highly recommended for production clusters 
using Erasure Coding to protect against rack-level failures.
-- **Configuration:**
+  - **Function:** This is the default policy for EC containers. It attempts to 
place each block (both data and parity) of an EC container on a different rack. 
For example, for an RS-6-3-1024k container (6 data blocks + 3 parity blocks), 
this policy will try to place the 9 blocks on 9 different racks. This "scatter" 
approach maximizes the fault tolerance, as the loss of a single rack will not 
impact more than one block of the container. [5]
+  - **Use Cases:** This policy is highly recommended for production clusters 
using Erasure Coding to protect against rack-level failures.
+  - **Configuration:**
 
     ```xml
     <property>
@@ -165,22 +172,22 @@ For Erasure Coded (EC) containers, SCM employs a 
specialized placement policy to
     </property>
     ```
 
-- **Behavior:** If the number of available racks is less than the number of 
blocks in the EC group, the policy will start placing more than one block on 
the same rack, while trying to keep the distribution as even as possible.
-- **Limitations:** Similar to `SCMContainerPlacementRackAware`, this policy is 
designed for single-layer rack topologies (e.g., `/rack/node`) and is not 
recommended for multi-layer hierarchies.
+  - **Behavior:** If the number of available racks is less than the number of 
blocks in the EC group, the policy will start placing more than one block on 
the same rack, while trying to keep the distribution as even as possible.
+  - **Limitations:** Similar to `SCMContainerPlacementRackAware`, this policy 
is designed for single-layer rack topologies (e.g., `/rack/node`) and is not 
recommended for multi-layer hierarchies.
 
 ## Optimizing Read Paths
 
-Enable by setting `ozone.network.topology.aware.read` to `true` in 
`ozone-site.xml`. [1]
+This feature, enabled by default since Ozone 1.4.0 
([HDDS-8300](https://issues.apache.org/jira/browse/HDDS-8300)), directs clients 
to read from the topologically closest Datanodes for replicated data, reducing 
latency and cross-rack traffic. It is recommended to keep this enabled when you 
have an accurate topology configuration.
+
+If you need to disable it, set `ozone.network.topology.aware.read` to `false` 
in `ozone-site.xml`:
 
 ```xml
 <property>
   <name>ozone.network.topology.aware.read</name>
-  <value>true</value>
+  <value>false</value>
 </property>
 ```
 
-This directs clients (replicated data) to read from topologically closest 
Datanodes, reducing latency and cross-rack traffic. Recommended with accurate 
topology.
-
 ## Summary of Best Practices
 
 - **Accurate Topology:** Maintain an accurate, up-to-date topology map (static 
or dynamic script); this is foundational.


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

(ozone-site) branch HDDS-9225-website-v2 updated: HDDS-14326. [Website v2] [Docs] [Administrator Guide] Update "Configuring Network Topology". (#226)

Reply via email to