Skip to content

Commit 5c547c2

Browse files
authored
Merge pull request #92898 from EricPonvelle/OSDOCS-9405_Worker-Node-Updates
OSDOCS-9405: Updated worker node docs to explain the short-lived nature in ROSA/HCP
2 parents 9d716ce + cfe9a3c commit 5c547c2

File tree

10 files changed

+81
-5
lines changed

10 files changed

+81
-5
lines changed

modules/rosa-hcp-upgrade-options.adoc

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ You can control the impact of upgrades to your workload by controlling which par
88

99
Upgrade only the hosted control plane:: This initiates upgrade of the hosted control plane. It does not impact your worker nodes.
1010

11-
Upgrade nodes in a machine pool:: This initiates a rolling replacement of nodes in the specified machine pool, and temporarily impacts the worker nodes on that machine pool. You can also upgrade multiple machine pools concurrently.
11+
Upgrade nodes in a machine pool:: {product-title} machine pool upgrades are designed to fully replace each node in a machine pool during the upgrade process. This provides additional security and stability benefits over performing an in-place upgrade. Upgrading the nodes in a machine pool initiates a rolling replacement of nodes in the specified machine pool, and temporarily impacts the worker nodes on that machine pool. You can also upgrade multiple machine pools concurrently.
1212

1313
[IMPORTANT]
1414
====

modules/rosa-policy-failure-points.adoc

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -11,6 +11,10 @@
1111

1212
ROSA can help further protect you against many common Kubernetes issues by adding Red{nbsp}Hat site reliability engineering (SRE) support and the option to deploy a multiple availability zone cluster, but there are several ways in which a container or infrastructure can still fail. By understanding potential points of failure, you can understand risks and appropriately architect both your applications and your clusters to be as resilient as necessary at each specific level.
1313

14+
ifdef::openshift-rosa,openshift-rosa-hcp[]
15+
include::snippets/rosa-node-lifecycle.adoc[]
16+
endif::openshift-rosa,openshift-rosa-hcp[]
17+
1418
[NOTE]
1519
====
1620
An outage can occur at several different levels of infrastructure and cluster components.

modules/rosa-sdpolicy-platform.adoc

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -68,6 +68,24 @@ ifndef::openshift-rosa-hcp[]
6868
endif::openshift-rosa-hcp[]
6969
clusters at this time. However, custom labels are supported when creating new machine pools.
7070

71+
[id="rosa-sdpolicy-node-lifecycle_{context}"]
72+
== Node lifecycle
73+
74+
Worker nodes are not guaranteed longevity, and may be replaced at any time as part of the normal operation and management of OpenShift.
75+
76+
A worker node might be replaced in the following circumstances:
77+
78+
* Machine health checks are deployed and configured to ensure that a worker node with a `NotReady` status is replaced to ensure smooth operation of the cluster.
79+
* AWS EC2 instances may be terminated when AWS detects irreparable failure of the underlying hardware that hosts the instance.
80+
ifdef::openshift-rosa[]
81+
* During upgrades, a new node is first provisioned to account for any loss of cluster resources during the upgrade process. Once this new node has been successfully integrated into the cluster via the previously described automated health checks, an older node is then removed from the cluster.
82+
endif::openshift-rosa[]
83+
ifdef::openshift-rosa-hcp[]
84+
* During upgrades, a new, upgraded node is first created and joined to the cluster. Once this new node has been successfully integrated into the cluster via the previously described automated health checks, an older node is then removed from the cluster.
85+
endif::openshift-rosa-hcp[]
86+
87+
For all containerized workloads running on a Kubernetes based system, it is best practice to configure applications to be resilient of node replacements.
88+
7189
[id="rosa-sdpolicy-backup-policy_{context}"]
7290
== Cluster backup policy
7391

nodes/index.adoc

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,19 @@ ifdef::openshift-rosa-hcp[]
1919
In {product-title}, the control plane nodes are hosted in a Red{nbsp}Hat-owned AWS account. Red{nbsp}Hat fully manages the control plane infrastructure for you.
2020
endif::openshift-rosa-hcp[]
2121

22+
ifdef::openshift-rosa,openshift-rosa-hcp[]
23+
[IMPORTANT]
24+
====
25+
Worker nodes are not guaranteed longevity, and may be replaced at any time as part of the normal operation and management of OpenShift. For more details, see
26+
ifdef::openshift-rosa[]
27+
xref:../rosa_architecture/rosa_policy_service_definition/rosa-service-definition.adoc#rosa-sdpolicy-node-lifecycle_rosa-service-definition[Node lifecycle].
28+
endif::openshift-rosa[]
29+
ifdef::openshift-rosa-hcp[]
30+
xref:../rosa_architecture/rosa_policy_service_definition/rosa-hcp-service-definition.adoc#rosa-sdpolicy-node-lifecycle_rosa-hcp-service-definition[Node lifecycle].
31+
endif::openshift-rosa-hcp[]
32+
====
33+
endif::openshift-rosa,openshift-rosa-hcp[]
34+
2235
Having stable and healthy nodes in a cluster is fundamental to the smooth functioning of your hosted application.
2336
In {product-title}, you can access, manage, and monitor a node through the `Node` object representing the node.
2437
Using the OpenShift CLI (`oc`) or the web console, you can perform the following operations on a node.

nodes/nodes/nodes-nodes-viewing.adoc

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -11,6 +11,10 @@ You can list all the nodes in your cluster to obtain information such as status,
1111
When you perform node management operations, the CLI interacts with node objects that are representations of actual node hosts.
1212
The master uses the information from node objects to validate nodes with health checks.
1313

14+
ifdef::openshift-rosa,openshift-rosa-hcp[]
15+
include::snippets/rosa-node-lifecycle.adoc[]
16+
endif::openshift-rosa,openshift-rosa-hcp[]
17+
1418
// The following include statements pull in the module files that comprise
1519
// the assembly. Include any combination of concept, procedure, or reference
1620
// modules required to cover the user story. You can also include other
@@ -30,3 +34,11 @@ endif::openshift-rosa,openshift-rosa-hcp,openshift-dedicated[]
3034
include::modules/nodes-nodes-viewing-listing-pods.adoc[leveloffset=+1]
3135

3236
include::modules/nodes-nodes-viewing-memory.adoc[leveloffset=+1]
37+
38+
.Additional resources
39+
ifdef::openshift-rosa[]
40+
* xref:../../rosa_architecture/rosa_policy_service_definition/rosa-service-definition.adoc#rosa-sdpolicy-node-lifecycle_rosa-service-definition[Node lifecycle].
41+
endif::openshift-rosa[]
42+
ifdef::openshift-rosa-hcp[]
43+
* xref:../../rosa_architecture/rosa_policy_service_definition/rosa-hcp-service-definition.adoc#rosa-sdpolicy-node-lifecycle_rosa-hcp-service-definition[Node lifecycle].
44+
endif::openshift-rosa-hcp[]

rosa_architecture/rosa_policy_service_definition/rosa-policy-understand-availability.adoc

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,3 +9,11 @@ toc::[]
99
Availability and disaster avoidance are extremely important aspects of any application platform. Although {product-title} (ROSA) provides many protections against failures at several levels, customer-deployed applications must be appropriately configured for high availability. To account for outages that might occur with cloud providers, additional options are available such as deploying a cluster across multiple availability zones and maintaining multiple clusters with failover mechanisms.
1010

1111
include::modules/rosa-policy-failure-points.adoc[leveloffset=+1]
12+
13+
.Additional resources
14+
ifdef::openshift-rosa[]
15+
* xref:../../rosa_architecture/rosa_policy_service_definition/rosa-service-definition.adoc#rosa-sdpolicy-node-lifecycle_rosa-service-definition[Node lifecycle].
16+
endif::openshift-rosa[]
17+
ifdef::openshift-rosa-hcp[]
18+
* xref:../../rosa_architecture/rosa_policy_service_definition/rosa-hcp-service-definition.adoc#rosa-sdpolicy-node-lifecycle_rosa-hcp-service-definition[Node lifecycle].
19+
endif::openshift-rosa-hcp[]

rosa_cluster_admin/rosa_nodes/rosa-nodes-machinepools-about.adoc

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -29,6 +29,9 @@ A machine pool creates compute machine sets that are all clones of the same conf
2929
ifdef::openshift-rosa-hcp[]
3030
In {product-title} clusters, the hosted control plane spans three availability zones (AZ) in the installed cloud region. Each machine pool in a {product-title} cluster deploys in a single subnet within a single AZ. Each of these AZs can have only one machine pool.
3131
endif::openshift-rosa-hcp[]
32+
ifdef::openshift-rosa,openshift-rosa-hcp[]
33+
include::snippets/rosa-node-lifecycle.adoc[]
34+
endif::openshift-rosa,openshift-rosa-hcp[]
3235

3336
Multiple machine pools can exist on a single cluster, and each machine pool can contain a unique node type and node size configuration.
3437

@@ -56,6 +59,10 @@ You cannot change the machine pool node type or size. The machine pool node type
5659
====
5760
* You can add a label to each added machine pool.
5861

62+
ifdef::openshift-rosa,openshift-rosa-hcp[]
63+
include::snippets/rosa-node-lifecycle.adoc[]
64+
endif::openshift-rosa,openshift-rosa-hcp[]
65+
5966
.Procedure
6067

6168
* *Optional:* Add a label to the default machine pool after configuration by using the default machine pool labels and running the following command:

rosa_hcp/rosa-hcp-egress-lockdown-install.adoc

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -51,7 +51,7 @@ While you may install and upgrade your clusters as you would a regular cluster,
5151
====
5252

5353
[id="rosa-hcp-egress-lockdown-install-creating_{context}"]
54-
== Creating a Virtual Private Cloud for your {hcp-title} clusters
54+
== Creating a Virtual Private Cloud for your {hcp-title} clusters
5555

5656
You must have a Virtual Private Cloud (VPC) to create a {hcp-title} cluster. Use one of the following methods to create a VPC:
5757

snippets/rosa-node-lifecycle.adoc

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
// Module included in the following assemblies:
2+
//
3+
// * nodes/index.adoc
4+
5+
:_mod-docs-content-type: SNIPPET
6+
7+
[IMPORTANT]
8+
====
9+
Worker node longevity is not guaranteed and may be replaced at any time as part of the normal operation and management of OpenShift. For more details about the node lifecycle, refer to _additional resources_.
10+
====

upgrading/rosa-hcp-upgrading.adoc

Lines changed: 7 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -9,12 +9,16 @@ toc::[]
99
include::modules/rosa-hcp-upgrade-options.adoc[leveloffset=+1]
1010

1111
.Additional resources
12-
ifdef::openshift-rosa-hcp[]
13-
* link:https://docs.openshift.com/rosa/cli_reference/rosa_cli/rosa-manage-objects-cli.html#rosa-edit-machinepool_rosa-managing-objects-cli[ROSA CLI reference: `rosa edit machinepool`]
14-
endif::openshift-rosa-hcp[]
1512
ifndef::openshift-rosa-hcp[]
1613
* xref:../cli_reference/rosa_cli/rosa-manage-objects-cli.adoc#rosa-edit-machinepool_rosa-managing-objects-cli[ROSA CLI reference: `rosa edit machinepool`]
1714
endif::openshift-rosa-hcp[]
15+
ifdef::openshift-rosa[]
16+
* xref:../rosa_architecture/rosa_policy_service_definition/rosa-service-definition.adoc#rosa-sdpolicy-node-lifecycle_rosa-service-definition[Node lifecycle]
17+
endif::openshift-rosa[]
18+
ifdef::openshift-rosa-hcp[]
19+
* link:https://docs.openshift.com/rosa/cli_reference/rosa_cli/rosa-manage-objects-cli.html#rosa-edit-machinepool_rosa-managing-objects-cli[ROSA CLI reference: `rosa edit machinepool`]
20+
* xref:../rosa_architecture/rosa_policy_service_definition/rosa-hcp-service-definition.adoc#rosa-sdpolicy-node-lifecycle_rosa-hcp-service-definition[Node lifecycle]
21+
endif::openshift-rosa-hcp[]
1822

1923
//This cannot be a module if we want to use the xrefs
2024
[id="rosa-lifecycle-policy_{context}"]

0 commit comments

Comments
 (0)