|
| 1 | +// Module included in the following assemblies: |
| 2 | +// |
| 3 | +// * backup_and_restore/hibernating-cluster.adoc |
| 4 | + |
| 5 | +:_mod-docs-content-type: PROCEDURE |
| 6 | +[id="hibernating-cluster-resume_{context}"] |
| 7 | += Resuming a hibernated cluster |
| 8 | + |
| 9 | +When you resume a hibernated cluster within 90 days, you might have to approve certificate signing requests (CSRs) for the nodes to become ready. |
| 10 | + |
| 11 | +It can take around 45 minutes for the cluster to resume, depending on the size of your cluster. |
| 12 | + |
| 13 | +.Prerequisites |
| 14 | + |
| 15 | +* You hibernated your cluster less than 90 days ago. |
| 16 | +* You have access to the cluster as a user with the `cluster-admin` role. |
| 17 | +
|
| 18 | +.Procedure |
| 19 | + |
| 20 | +. Within 90 days of cluster hibernation, resume the cluster virtual machines: |
| 21 | ++ |
| 22 | +Use the tools native to your cluster's cloud environment to resume the cluster's virtual machines. |
| 23 | + |
| 24 | +. Wait about 5 minutes, depending on the number of nodes in your cluster. |
| 25 | + |
| 26 | +. Approve CSRs for the nodes: |
| 27 | + |
| 28 | +.. Check that there is a CSR for each node in the `NotReady` state: |
| 29 | ++ |
| 30 | +[source,terminal] |
| 31 | +---- |
| 32 | +$ oc get csr |
| 33 | +---- |
| 34 | ++ |
| 35 | +.Example output |
| 36 | +[source,terminal] |
| 37 | +---- |
| 38 | +NAME AGE SIGNERNAME REQUESTOR REQUESTEDDURATION CONDITION |
| 39 | +csr-4dwsd 37m kubernetes.io/kube-apiserver-client system:node:ci-ln-812tb4k-72292-8bcj7-worker-c-q8mw2 24h Pending |
| 40 | +csr-4vrbr 49m kubernetes.io/kube-apiserver-client system:node:ci-ln-812tb4k-72292-8bcj7-master-1 24h Pending |
| 41 | +csr-4wk5x 51m kubernetes.io/kubelet-serving system:node:ci-ln-812tb4k-72292-8bcj7-master-1 <none> Pending |
| 42 | +csr-84vb6 51m kubernetes.io/kube-apiserver-client-kubelet system:serviceaccount:openshift-machine-config-operator:node-bootstrapper <none> Pending |
| 43 | +---- |
| 44 | + |
| 45 | +.. Approve each valid CSR by running the following command: |
| 46 | ++ |
| 47 | +[source,terminal] |
| 48 | +---- |
| 49 | +$ oc adm certificate approve <csr_name> |
| 50 | +---- |
| 51 | + |
| 52 | +.. Verify that all necessary CSRs were approved by running the following command: |
| 53 | ++ |
| 54 | +[source,terminal] |
| 55 | +---- |
| 56 | +$ oc get csr |
| 57 | +---- |
| 58 | ++ |
| 59 | +.Example output |
| 60 | +[source,terminal] |
| 61 | +---- |
| 62 | +NAME AGE SIGNERNAME REQUESTOR REQUESTEDDURATION CONDITION |
| 63 | +csr-4dwsd 37m kubernetes.io/kube-apiserver-client system:node:ci-ln-812tb4k-72292-8bcj7-worker-c-q8mw2 24h Approved,Issued |
| 64 | +csr-4vrbr 49m kubernetes.io/kube-apiserver-client system:node:ci-ln-812tb4k-72292-8bcj7-master-1 24h Approved,Issued |
| 65 | +csr-4wk5x 51m kubernetes.io/kubelet-serving system:node:ci-ln-812tb4k-72292-8bcj7-master-1 <none> Approved,Issued |
| 66 | +csr-84vb6 51m kubernetes.io/kube-apiserver-client-kubelet system:serviceaccount:openshift-machine-config-operator:node-bootstrapper <none> Approved,Issued |
| 67 | +---- |
| 68 | ++ |
| 69 | +CSRs should show `Approved,Issued` in the `CONDITION` column. |
| 70 | + |
| 71 | +. Verify that all nodes now show as ready by running the following command: |
| 72 | ++ |
| 73 | +[source,terminal] |
| 74 | +---- |
| 75 | +$ oc get nodes |
| 76 | +---- |
| 77 | ++ |
| 78 | +.Example output |
| 79 | +[source,terminal] |
| 80 | +---- |
| 81 | +NAME STATUS ROLES AGE VERSION |
| 82 | +ci-ln-812tb4k-72292-8bcj7-master-0 Ready control-plane,master 32m v1.31.3 |
| 83 | +ci-ln-812tb4k-72292-8bcj7-master-1 Ready control-plane,master 32m v1.31.3 |
| 84 | +ci-ln-812tb4k-72292-8bcj7-master-2 Ready control-plane,master 32m v1.31.3 |
| 85 | +Ci-ln-812tb4k-72292-8bcj7-worker-a-zhdvk Ready worker 19m v1.31.3 |
| 86 | +ci-ln-812tb4k-72292-8bcj7-worker-b-9hrmv Ready worker 19m v1.31.3 |
| 87 | +ci-ln-812tb4k-72292-8bcj7-worker-c-q8mw2 Ready worker 19m v1.31.3 |
| 88 | +---- |
| 89 | ++ |
| 90 | +All nodes should show `Ready` in the `STATUS` column. It might take a few minutes for all nodes to become ready after approving the CSRs. |
| 91 | + |
| 92 | +. Wait for cluster Operators to restart to load the new certificates. |
| 93 | ++ |
| 94 | +This might take 5 or 10 minutes. |
| 95 | + |
| 96 | +. Verify that all cluster Operators are in a good state by running the following command: |
| 97 | ++ |
| 98 | +[source,terminal] |
| 99 | +---- |
| 100 | +$ oc get clusteroperators |
| 101 | +---- |
| 102 | ++ |
| 103 | +.Example output |
| 104 | +[source,terminal] |
| 105 | +---- |
| 106 | +NAME VERSION AVAILABLE PROGRESSING DEGRADED SINCE MESSAGE |
| 107 | +authentication 4.18.0-0 True False False 51m |
| 108 | +baremetal 4.18.0-0 True False False 72m |
| 109 | +cloud-controller-manager 4.18.0-0 True False False 75m |
| 110 | +cloud-credential 4.18.0-0 True False False 77m |
| 111 | +cluster-api 4.18.0-0 True False False 42m |
| 112 | +cluster-autoscaler 4.18.0-0 True False False 72m |
| 113 | +config-operator 4.18.0-0 True False False 72m |
| 114 | +console 4.18.0-0 True False False 55m |
| 115 | +... |
| 116 | +---- |
| 117 | ++ |
| 118 | +All cluster Operators should show `AVAILABLE`=`True`, `PROGRESSING`=`False`, and `DEGRADED`=`False`. |
0 commit comments