HDDS-11233. Ozone Storage Policy Support. #6989

xichen01 · 2024-07-25T15:08:47Z

What changes were proposed in this pull request?

Design storage policy for for Ozone. Please comment inline on the markdown document to ask questions and post feedback. Switch to Rich Diff mode for smoother reading.

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-11233

How was this patch tested?

N/A

kerneltime · 2024-07-25T18:42:06Z

cc @sodonnel

ivandika3 · 2024-07-26T01:33:06Z

hadoop-hdds/docs/content/design/storage-policy.md

+| Tier | StorageType of Pipeline | One Replication 
+Container Replicas Storage Type | Three replication
+Container Replicas Storage Type | EC
+Container Replicas Storage Type |
+| --- | --- | --- | --- | --- |
+| SSD | SSD | SSD | 3 SSD | n SSD |
+| DISK | DISK | DISK | 3 DISK | n DISK |
+| ARCHIVE | ARCHIVE | ARCHIVE | 3 ARCHIVE | n ARCHIVE |


Nit: The table does not seem to be rendered properly.

vtutrinov · 2025-05-20T07:58:56Z

hadoop-hdds/docs/content/design/storage-policy.md

+
+| Storage Policy | Storage Tier for Write | Fallback Tier for Write |
+| --- | --- | --- |
+| Hot | SSD | DISK |


Is this the final list of desired/planned storage policies? Wouldn't we like to implement policies like in HDFS - https://hadoop.apache.org/docs/r3.4.1/hadoop-project-dist/hadoop-hdfs/ArchivalStorage.html ?

Yes, we only support these three simple storage policies current, we can extend it in the future.

vtutrinov · 2025-05-20T08:03:16Z

hadoop-hdds/docs/content/design/storage-policy.md

+The relation of Storage Policy, Storage Type and Storage Tier
+
+- The storage policy is the property of key/bucket/ prefix (Managed by OM);
+- The storage tier is the property of Pipeline and Container (Managed by SCM);


Will we deal with the storage tier as an entry of the cluster topology?

What does “deal with as a cluster topology” mean?
Storage Tier is the property of Pipeline and Container, when we create the key we will select the matched Storage Tier Pipeline and Container based on the key Storage Policy.

I meant org.apache.hadoop.hdds.scm.net.NodeSchema. Will the Storage Tier (aka rack of specific storage volumes) become a part of the network topology

Storage Tier is more like the ReplicationConfig, will be a independent fields in ContainerInfo and Pipeline

vtutrinov · 2025-05-20T08:17:46Z

hadoop-hdds/docs/content/design/storage-policy.md

+## SCM Pipeline / ContainerReplica Management and Block Allocation
+
+- Pipeline needs to add tier-related fields to distinguish between different tiers of Pipelines.
+- For Pipelines tier:


The current implementation of the background pipeline creator creates a limited list of RATIS/THREE-pipelines per datanode. The design doc proposes to deal with specific pipelines for different storage policies. Is there any proposal doc on how we gonna deal with the limitation mentioned earlier? (How should we deal with the distribution of the pipelines of different types: 30% per storage tire or some other option? What if the pipelines of one of the storrage tire will not be used?)

The count limit for current Pipeline count will be extend to each storage tier, so in the extreme condition, there will be three times Pipeline in a cluster.

But in the actual situation, there may not be so many Pipelines, because a Pipeline may support multiple tiers, such as DISK and SSD, because there may be both SSD and DISK type Volume in a Datanode machine.

And the background Pipeline creator will check the current cluster's Volume type, if a cluster has only SSD Volume, the background Pipeline will create a Pipeline with only SSD Tier type.

vtutrinov · 2025-05-22T08:15:26Z

@xichen01 is there an understanding of the time frame for the functionality to be implemented? I'd start creating the JIRA tickets and implementing them

vtutrinov · 2025-05-28T11:37:04Z

@xichen01 @kerneltime @sodonnel, could you help somehow to force the review of the design doc? The feature is very needed, and I would gladly start implementation.

xichen01 added 2 commits July 25, 2024 23:06

HDDS-11233. Ozone Storage Policy Support

ca6cd65

add License

05d6f9f

xichen01 changed the title ~~HDDS-11233. Ozone Storage Policy Support~~ HDDS-11233. Ozone Storage Policy Support. Jul 25, 2024

xichen01 added documentation Improvements or additions to documentation design labels Jul 25, 2024

ivandika3 reviewed Jul 26, 2024

View reviewed changes

vtutrinov reviewed May 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HDDS-11233. Ozone Storage Policy Support. #6989

HDDS-11233. Ozone Storage Policy Support. #6989

Uh oh!

xichen01 commented Jul 25, 2024

Uh oh!

kerneltime commented Jul 25, 2024

Uh oh!

ivandika3 Jul 26, 2024

Uh oh!

vtutrinov May 20, 2025

Uh oh!

xichen01 May 20, 2025

Uh oh!

vtutrinov May 20, 2025

Uh oh!

xichen01 May 20, 2025

Uh oh!

vtutrinov May 20, 2025

Uh oh!

xichen01 May 21, 2025

Uh oh!

vtutrinov May 20, 2025

Uh oh!

xichen01 May 20, 2025

Uh oh!

vtutrinov commented May 22, 2025

Uh oh!

vtutrinov commented May 28, 2025

Uh oh!

Uh oh!

HDDS-11233. Ozone Storage Policy Support. #6989

Are you sure you want to change the base?

HDDS-11233. Ozone Storage Policy Support. #6989

Uh oh!

Conversation

xichen01 commented Jul 25, 2024

What changes were proposed in this pull request?

What is the link to the Apache JIRA

How was this patch tested?

Uh oh!

kerneltime commented Jul 25, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vtutrinov commented May 22, 2025

Uh oh!

vtutrinov commented May 28, 2025

Uh oh!

Uh oh!