Merge pull request #13610 from kalexand-rh/autoscaler

kalexand-rh · web-flow · commit 37f704fa8b93 · 2019-02-14T08:48:31.000-05:00
draft of autoscaler assembly
diff --git a/_topic_map.yml b/_topic_map.yml
@@ -191,6 +191,13 @@ Topics:
 - Name: Pruning objects
   File: pruning-objects
 ---
+Name: Control Plane management
+Dir: control-plane-management
+Distros: openshift-origin, openshift-enterprise
+Topics:
+- Name: Applying autoscaling to a cluster
+  File: applying-autoscaling
+---
 Name: Networking
 Dir: networking
 Distros: openshift-*
diff --git a/control-plane-management/applying-autoscaling.adoc b/control-plane-management/applying-autoscaling.adoc
@@ -0,0 +1,49 @@
+[id='applying-autoscaling']
+= Applying autoscaling to a {product-title} cluster
+include::modules/common-attributes.adoc[]
+:context: pplying-autoscaling
+
+toc::[]
+
+Applying autoscaling to a {product-title} cluster involves deploying a
+ClusterAutoscaler and then deploying MachineAutoscalers for each Machine type
+in your cluster.
+
+include::modules/cluster-autoscaler-about.adoc[leveloffset=+1]
+
+include::modules/machine-autoscaler-about.adoc[leveloffset=+1]
+
+[id='configuring-clusterautoscaler']
+= Configuring the ClusterAutoscaler
+
+First, deploy the ClusterAutoscaler to manage automatic resource scaling in
+your {product-title} cluster.
+
+include::modules/cluster-autoscaler-crd.adoc[leveloffset=+2]
+
+:FeatureName: ClusterAutoscaler
+include::modules/deploying-resource.adoc[leveloffset=+2]
+
+[id='configuring-machineautoscaler']
+= Configuring the MachineAutoscalers
+
+After you deploy the ClusterAutoscaler, you can
+deploy MachineAutoscaler resources for each of the machine types in your
+cluster to manage deployments of individual machines.
+
+[NOTE]
+====
+You must configure separate resources for each MachineSet that you want to
+autoscale.
+====
+
+include::modules/machine-autoscaler-crd.adoc[leveloffset=+2]
+
+:FeatureName: MachineAutoscaler
+include::modules/deploying-resource.adoc[leveloffset=+2]
+
+= Additional resources
+
+* For more information about pod priority, see
+xref:../nodes/nodes-pods-priority.adoc#nodes-pods-priority[Including pod priority in pod scheduling decisions in {product-title}].
+
diff --git a/control-plane-management/images b/control-plane-management/images
@@ -0,0 +1 @@
+../images
diff --git a/control-plane-management/modules b/control-plane-management/modules
@@ -0,0 +1 @@
+../modules
diff --git a/modules/cluster-autoscaler-about.adoc b/modules/cluster-autoscaler-about.adoc
@@ -0,0 +1,82 @@
+// Module included in the following assemblies:
+//
+// * control-plane-management/applying-autoscaling.adoc
+
+[id='cluster-autoscaler-about-{context}']
+= About the ClusterAutoscaler
+
+The ClusterAutoscaler adjusts the size of an {product-title} cluster to meet
+its current deployment needs. It uses declarative, Kubernetes-style arguments to
+provide infrastructure management that does not rely on objects of a specific
+cloud provider.
+
+The ClusterAutoscaler increases the size of the cluster when there are pods
+that failed to schedule on any of the current nodes due to insufficient
+resources or when another node is necessary to meet deployment needs. The
+ClusterAutoscaler does not increase the cluster resources beyond the limits
+that you specify.
+
+The ClusterAutoscaler decreases the size of the cluster when some nodes are
+consistently not needed for a significant period, such as when it has low
+resource use and all of its important pods can fit on other nodes.
+
+If the following types of pods are present on a node, the ClusterAutoscaler
+will not remove the node:
+
+* Pods with restrictive PodDisruptionBudgets (PDBs).
+* Kube-system pods that do not run on the node by default.
+* Kube-system pods that do not have a PDBB or have a PDB that is too restrictive.
+* Pods that are not backed by a controller object such as a Deployment,
+ReplicaSet, or StatefulSet.
+* Pods with local storage.
+* Pods that cannot be moved elsewhere because of a lack of resources,
+incompatible node selectors or affinity, matching anti-affinity, and so on.
+* Unless they also have a `"cluster-autoscaler.kubernetes.io/safe-to-evict": "true"`
+annotation, pods that have a `"cluster-autoscaler.kubernetes.io/safe-to-evict": "false"`
+annotation.
+
+If you configure the ClusterAutoscaler, additional usage restrictions apply:
+
+* Do not modify the nodes that are in autoscaled node groups directly. All nodes
+within the same node group have the same capacity and labels and run the same
+system pods.
+* Specify requests for your pods.
+* If you need to prevent pods from being deleted too quickly, configure
+appropriate PDBs.
+* Confirm that your cloud provider quota is large enough to support the
+maximum node pools that you configure.
+* Do not run additional node group autoscalers, especially the ones offered by
+your cloud provider.
+
+
+The Horizontal Pod Autoscaler (HPA) and the ClusterAutoscaler modify cluster
+resources in different ways. The HPA changes the deployment's or ReplicaSet's
+number of replicas based on the current CPU load.
+If the load increases, the HPA creates new replicas, regardless of the amount
+of resources available to the cluster.
+If there are not enough resources, the ClusterAutoscaler adds resources so that
+the HPA-created pods can run.
+If the load decreases, the HPA stops some replicas. If this action causes some
+nodes to be underutilized or completely empty, the ClusterAutoscaler deletes
+the unnecessary nodes.
+
+
+The ClusterAutoscaler takes pod priorities into account. The Pod Priority and
+Preemption feature enables scheduling pods based on priorities if the cluster
+does not have enough resources, but the ClusterAutoscaler ensures that the
+cluster has resources to run all pods. To honor the intention of both features,
+the ClusterAutoscaler inclues a priority cutoff. You can use this cutoff to
+schedule "best-effort" pods, which do not cause the ClusterAutoscaler to
+increase resources but instead run only when spare resources are available.
+
+Pods with priority lower than the cutoff value do not cause the cluster to scale
+up or prevent the cluster from scaling down. No new nodes are added to run the
+pods, and nodes running these pods might be deleted to free resources.
+
+////
+Default priority cutoff is 0. It can be changed using `--expendable-pods-priority-cutoff` flag,
+but we discourage it.
+ClusterAutoscaler also doesn't trigger scale-up if an unschedulable pod is already waiting for a lower
+priority pod preemption.
+////
+
diff --git a/modules/cluster-autoscaler-crd.adoc b/modules/cluster-autoscaler-crd.adoc
@@ -0,0 +1,60 @@
+// Module included in the following assemblies:
+//
+// * control-plane-management/applying-autoscaling.adoc
+
+[id='cluster-autoscaler-crd-{context}']
+= ClusterAutoscaler resource definition
+
+This `ClusterAutoscaler` resource definition shows the parameters and sample
+values for the ClusterAutoscaler.
+
+
+[source,yaml]
+----
+apiVersion: "autoscaling.openshift.io/v1alpha1"
+kind: "ClusterAutoscaler"
+metadata:
+  name: "default"
+spec:
+  podPriorityThreshold: -10 <1>
+  resourceLimits:
+    maxNodesTotal: 24 <2>
+    cores:
+      min: 8 <3>
+      max: 128 <4>
+    memory:
+      min: 4 <5>
+      max: 256 <6>
+    gpus:
+      - type: nvidia.com/gpu <7>
+        min: 0 <8>
+        max: 16 <9>
+      - type: amd.com/gpu <7>
+        min: 0 <8>
+        max: 4 <9>
+  scaleDown:
+    enabled: true <10>
+    delayAfterAdd: 10s <11>
+    delayAfterDelete: 10s <12>
+    delayAfterFailure: 10s <13>
+    unneededTime: 10s <14>
+----
+<1> Specify the priority that a pod must exceed to cause the ClusterAutoscaler
+to deploy additional nodes. Enter a 32-bit integer value. The
+`podPriorityThreshold` value is compared to the value of the `PriorityClass` that
+you assign to each pod.
+<2> Specify the maximum number of nodes to deploy.
+<3> Specify the minimum number of cores to deploy.
+<4> Specify the maximum number of cores to deploy.
+<5> Specify the minimum amount of memory, in GiB, per node.
+<6> Specify the maximum amount of memory, in GiB, per node.
+<7> Specify the type of GPU node to deploy. Only `nvidia.com/gpu` and `amd.com/gpu`
+are valid types.
+<8> Specify the minimum number of GPU cores to deploy.
+<9> Specify the maxiumum number of GPU cores to deploy.
+<10> Specify whether the ClusterAutoscaler can remove unnecessary nodes.
+<11> Specify the period, in seconds, to wait before deploying another node.
+<12> Specify the period, in seconds, to wait before deleting another node.
+<13> Specify the period, in seconds, to wait to deploy another node if the
+current deployment fails.
+<14> Specify the period, in seconds, before an unnecessary node is deleted.
diff --git a/modules/cluster-autoscaler-deploying.adoc b/modules/cluster-autoscaler-deploying.adoc
@@ -0,0 +1,21 @@
+// Module included in the following assemblies:
+//
+// * control-plane-management/applying-autoscaling.adoc
+
+[id='cluster-autoscaler-deploying-{context}']
+= Deploying the ClusterAutoscaler
+
+To deploy the ClusterAutoscaler, you create an instance of the `ClusterAutoscaler`
+resource.
+
+.Procedure
+
+. Create a YAML file for the `ClusterAutoscaler` resource that is called
+`default.yaml`, and, after you customize it, save the resource definition.
+
+. Create the resource in the cluster:
++
+[source,bash]
+----
+$ oc create -f default.yaml
+----
diff --git a/modules/deploying-resource.adoc b/modules/deploying-resource.adoc
@@ -0,0 +1,31 @@
+// Be sure to set the :FeatureName: value in each assembly on the line before
+// the include statement for this module. For example, to set the FeatureName
+// value to "ClusterAutoscaler", add the following line to the assembly:
+// :FeatureName: ClusterAutoscaler
+// Module included in the following assemblies:
+//
+// * control-plane-management/applying-autoscaling.adoc
+
+
+
+[id='{FeatureName}-deploying-{context}']
+= Deploying the {FeatureName}
+
+To deploy the {FeatureName}, you create an instance of the `{FeatureName}`
+resource.
+
+.Procedure
+
+. Create a YAML file for the `{FeatureName}` resource that contains the
+customized resource definition.
+
+. Create the resource in the cluster:
++
+[source,bash]
+----
+$ oc create -f <filename>.yaml <1>
+----
+<1> `<filename>` is the name of the resource file that you customized.
+
+// Undefine {FeatureName} attribute, so that any mistakes are easily spotted
+:!FeatureName:
diff --git a/modules/machine-api-overview.adoc b/modules/machine-api-overview.adoc
@@ -39,7 +39,7 @@ available by the `ClusterAutoscalerOperator`.
 `MachineHealthChecker`:: This resource detects when a machine is unhealthy,
 deletes it, and, on supported platforms, makes a new machine.
 `ClusterAutoscaler`:: This resource is based on the upstream
-link:https://github.com/kubernetes/autoscaler/tree/master/cluster-autoscaler[Cluster Autoscaler]
+link:https://github.com/kubernetes/autoscaler/tree/master/cluster-autoscaler[ClusterAutoscaler]
 project. In the {product-title} implementation, it is integrated with the
 Cluster API by extending the `MachineSet` API.
 `ClusterAutoscalerOperator`:: Instead of interacting with the `ClusterAutoscaler`
diff --git a/modules/machine-autoscaler-about.adoc b/modules/machine-autoscaler-about.adoc
@@ -0,0 +1,12 @@
+// Module included in the following assemblies:
+//
+// * control-plane-management/applying-autoscaling.adoc
+
+[id='machine-autoscaler-about-{context}']
+= About the MachineAutoscaler
+
+The MachineAutoscaler adjusts the number of Machines in the MachineSets that you
+deploy in a {product-title} cluster. You can scale both the default `worker`
+MachineSet and any other MachineSets that you create. The MachineAutoscaler
+makes more Machines when the cluster runs out of resources to support more
+deployments.
diff --git a/modules/machine-autoscaler-crd.adoc b/modules/machine-autoscaler-crd.adoc
@@ -0,0 +1,36 @@
+// Module included in the following assemblies:
+//
+// * control-plane-management/applying-autoscaling.adoc
+
+[id='machine-autoscaler-crd-{context}']
+= MachineAutoscaler resource definition
+
+This MachineAutoscaler resource definition shows the parameters and sample
+values for the MachineAutoscaler.
+
+
+[source,yaml]
+----
+apiVersion: "autoscaling.openshift.io/v1beta1"
+kind: "MachineAutoscaler"
+metadata:
+  name: "worker" <1>
+  namespace: "openshift-cluster-api"
+spec:
+  minReplicas: 1 <2>
+  maxReplicas: 12 <3>
+  scaleTargetRef: <4>
+    apiVersion: machine.openshift.io/v1beta1
+    kind: MachineSet
+    name: worker
+----
+<1> Specify the `MachineAutoscaler` name. To make it easier to identify
+which MachineSet this MachineAutoscaler scales, specify or include the name of
+the MachineSet to scale.
+<2> Specify the minimum number Machines of the specified type to deploy in the
+specified AWS zone.
+<3> Specify the maxiumum number Machines of the specified type to deploy in the
+specified AWS zone.
+<4> In this section, provide values that describe the existing MachineSet to
+scale. The `kind` parameter value is always `MachineSet`, and the `name` value
+must match the name of an existing MachineSet.