You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/admin/runai-setup/cluster-setup/cluster-prerequisites.md
+12-5Lines changed: 12 additions & 5 deletions
Original file line number
Diff line number
Diff line change
@@ -1,4 +1,11 @@
1
-
Below are the prerequisites of a cluster installed with Run:ai.
1
+
---
2
+
title: Prerequisites in a nutshell
3
+
summary: This article outlines the required prerequisites for a Run:ai installation.
4
+
authors:
5
+
- Jason Novich
6
+
- Yaron Goldberg
7
+
date: 2024-Apr-8
8
+
---
2
9
3
10
## Prerequisites in a Nutshell
4
11
@@ -63,7 +70,6 @@ For an up-to-date end-of-life statement of Kubernetes see [Kubernetes Release Hi
63
70
64
71
#### Pod Security Admission
65
72
66
-
67
73
Run:ai version 2.15 and above supports `restricted` policy for [Pod Security Admission](https://kubernetes.io/docs/concepts/security/pod-security-admission/){target=_blank} (PSA) on OpenShift only. Other Kubernetes distributions are only supported with `Privileged` policy.
68
74
69
75
For Run:ai on OpenShift to run with PSA `restricted` policy:
@@ -75,8 +81,9 @@ For Run:ai on OpenShift to run with PSA `restricted` policy:
75
81
pod-security.kubernetes.io/enforce=privileged
76
82
pod-security.kubernetes.io/warn=privileged
77
83
```
84
+
78
85
2. The workloads submitted through Run:ai should comply with the restrictions of PSA `restricted` policy, which are dropping all Linux capabilities and setting `runAsNonRoot` to `true`. This can be done and enforced using [Policies](../../workloads/policies/policies.md).
79
-
86
+
80
87
### NVIDIA
81
88
82
89
Run:ai has been certified on **NVIDIA GPU Operator** 22.9 to 23.9. Older versions (1.10 and 1.11) have a documented [NVIDIA issue](https://github.com/NVIDIA/gpu-feature-discovery/issues/26){target=_blank}.
@@ -123,7 +130,7 @@ Follow the [Getting Started guide](https://docs.nvidia.com/datacenter/cloud-nati
123
130
124
131
=== "RKE2"
125
132
* Follow the [Getting Started guide](https://docs.nvidia.com/datacenter/cloud-native/gpu-operator/latest/getting-started.html#rancher-kubernetes-engine-2){target=blank} to install the NVIDIA GPU Operator.
126
-
* Make sure to specify the `CONTAINERD_CONFIG` option exactly with the value specified in the document `/var/lib/rancher/rke2/agent/etc/containerd/config.toml.tmpl` even though the file may not exist in your system.
133
+
* Make sure to specify the `CONTAINERD_CONFIG` option exactly with the value specified in the document `/var/lib/rancher/rke2/agent/etc/containerd/config.toml.tmpl` even though the file may not exist in your system.
127
134
128
135
<!--
129
136
=== "RKE2"
@@ -302,7 +309,7 @@ However, for the URL to be accessible outside the cluster you must configure you
0 commit comments