You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Jan 29, 2025. It is now read-only.
Note, you might need privileges to access default location of these files, use `sudo <same command` then in this case.
58
60
59
-
The "deploy"-folder has the necessary scripts for deploying. You can simply deploy by running:
61
+
The "deploy"-folder has the necessary scripts for deploying GAS. You can simply deploy by running:
60
62
61
-
``kubectl apply -f deploy/``
63
+
```bash
64
+
kubectl apply -f deploy/
65
+
```
62
66
63
67
After this is run GAS should be operable in the cluster and should be visible after running ``kubectl get pods``
64
68
65
69
Remember to run the configure-scheduler.sh script, or perform similar actions in your cluster if the script does not work in your environment directly.
66
70
71
+
#### Build GAS locally
72
+
73
+
GPU Aware Scheduling uses go modules. It requires Go 1.18 with modules enabled for building.
74
+
To build GAS locally on your host:
75
+
76
+
```bash
77
+
make build
78
+
```
79
+
80
+
You can also build inside docker, which creates the container:
81
+
82
+
```bash
83
+
make image
84
+
```
85
+
86
+
To deploy locally built GAS container image, just change the [deployment YAML](deploy/gas-deployment.yaml) and deploy normally as if it was pre-built image, see above.
87
+
67
88
### Configuration flags
68
89
The below flags can be passed to the binaries at run time.
|balancedResource| string | enable named resource balancing between GPUs | --balancedResource| ""
81
102
103
+
Some features are based on the labels put onto pods, for full features list see [usage doc](docs/usage.md)
104
+
82
105
#### Balanced resource (optional)
83
106
GAS can be configured to balance named resources so that the resource requests are distributed as evenly as possible between the GPUs. For example if the balanced resource is set to "tiles" and the containers request 1 tile each, the first container could get tile from "card0", the second from "card1", the third again from "card0" and so on.
84
107
85
108
## Adding the resource to make a deployment use GAS Scheduler Extender
86
109
87
110
For example, in a deployment file:
88
-
```
111
+
```yaml
89
112
apiVersion: apps/v1
90
113
kind: Deployment
91
114
metadata:
@@ -112,7 +135,7 @@ spec:
112
135
```
113
136
114
137
There is one change to the yaml here:
115
-
- A resources/limits entry requesting the resource gpu.intel.com/i915. This is used to restrict the use of GAS to only selected pods. If this is not in a pod spec the pod will not be scheduled by GAS.
138
+
- A resources/limits entry requesting the resource gpu.intel.com/i915 will make GAS take part in scheduling such deployment. If this resource is not requested, GAS will not be used during scheduling of the pod.
0 commit comments