You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/developer/metrics/metrics.md
+24-24Lines changed: 24 additions & 24 deletions
Original file line number
Diff line number
Diff line change
@@ -114,31 +114,31 @@ For additional information, see Kubernetes [kube-state-metrics](https://github.c
114
114
115
115
Starting in cluster version 2.17, some of the metrics names have been changed. In addition some Run:ai metrics are available as API endpoints. Using the API endpoints is more efficient and provides an easier way of retrieving metrics in any application. The following table lists the metrics that were changed.
116
116
117
-
| Metric name in version 2.16 | 2.17 Change Description | 2.17 API Endpoint|
| runai\_active\_job\_cpu\_requested\_cores | available also via API |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "CPU\_REQUEST\_CORES" metricType|
120
-
| runai\_active\_job\_memory\_requested\_bytes | available also via API |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "CPU\_MEMORY\_REQUEST\_BYTES" metricType|
121
-
| runai\_cluster\_cpu\_utilization | available also via API |https://app.run.ai/api/v2/clusters/{clusterUuid}/metrics ; with "CPU\_UTILIZATION" metricType|
122
-
| runai\_cluster\_memory\_utilization | available also via API |https://app.run.ai/api/v2/clusters/{clusterUuid}/metrics ; with "CPU\_MEMORY\_UTILIZATION" metricType|
123
-
| runai\_gpu\_utilization\_non\_fractional\_jobs | no longer available ||
| runai\_gpu\_utilization\_per\_pod\_per\_gpu | available also via API |https://app.run.ai/api/v1/workloads/{workloadId}/pods/{podId}/metrics ; with "GPU\_UTILIZATION\_PER\_GPU" metricType|
126
-
| runai\_gpu\_utilization\_per\_workload | available also via API + labels changed |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "GPU\_UTILIZATION" metricType|
127
-
| runai\_job\_image | no longer available ||
128
-
| runai\_job\_requested\_gpu\_memory | available also via API + renamed to: "runai\_requested\_gpu\_memory\_mb\_per\_workload" with different labels |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "GPU\_MEMORY\_REQUEST\_BYTES" metricType|
129
-
| runai\_job\_requested\_gpus | renamed to: "runai\_requested\_gpus\_per\_workload" with different labels ||
130
-
| runai\_job\_total\_runtime | renamed to: "runai\_run\_time\_seconds\_per\_workload" with different labels ||
131
-
| runai\_job\_total\_wait\_time | renamed to: "runai\_wait\_time\_seconds\_per\_workload" with different labels ||
132
-
| runai\_gpu\_memory\_used\_mebibytes\_per\_workload | available also via API + labels changed |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "GPU\_MEMORY\_USAGE\_BYTES" metricType|
133
-
| runai\_gpu\_memory\_used\_mebibytes\_per\_pod\_per\_gpu | available also via API + labels changed|https://app.run.ai/api/v1/workloads/{workloadId}/pods/{podId}/metrics ; with "GPU\_MEMORY\_USAGE\_BYTES\_PER\_GPU" metricType |
134
-
| runai\_node\_gpu\_used\_memory\_bytes | renamed and changed units: "runai\_gpu\_memory\_used\_mebibytes\_per\_node" ||
135
-
| runai\_node\_total\_memory\_bytes | renamed and changed units: "runai\_gpu\_memory\_total\_mebibytes\_per\_node" ||
136
-
| runai\_project\_info | labels changed ||
137
-
| runai\_active\_job\_cpu\_limits | available also via + renamed to: "runai\_cpu\_limits\_per\_active\_workload" |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "CPU\_LIMIT\_CORES" metricType |
138
-
| runai\_job\_cpu\_usage | available also via + labels changed |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "CPU\_USAGE\_CORES" metricType |
139
-
| runai\_active\_job\_memory\_limits | available also via + renamed to: "runai\_memory\_limits\_per\_active\_workload" |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "CPU\_MEMORY\_LIMIT\_BYTES" metricType |
117
+
| Metric name in version 2.16 | 2.17 Change Description | 2.17 API Endpoint |
118
+
|--|--|--|
119
+
| runai\_active\_job\_cpu\_requested\_cores | available also via API |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "CPU\_REQUEST\_CORES" metricType |
120
+
| runai\_active\_job\_memory\_requested\_bytes | available also via API |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "CPU\_MEMORY\_REQUEST\_BYTES" metricType |
121
+
| runai\_cluster\_cpu\_utilization | available also via API |https://app.run.ai/api/v2/clusters/{clusterUuid}/metrics ; with "CPU\_UTILIZATION" metricType |
122
+
| runai\_cluster\_memory\_utilization | available also via API |https://app.run.ai/api/v2/clusters/{clusterUuid}/metrics ; with "CPU\_MEMORY\_UTILIZATION" metricType |
123
+
| runai\_gpu\_utilization\_non\_fractional\_jobs | no longer available ||
| runai\_gpu\_utilization\_per\_pod\_per\_gpu | available also via API |https://app.run.ai/api/v1/workloads/{workloadId}/pods/{podId}/metrics ; with "GPU\_UTILIZATION\_PER\_GPU" metricType |
126
+
| runai\_gpu\_utilization\_per\_workload | available also via API and labels changed |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "GPU\_UTILIZATION" metricType |
127
+
| runai\_job\_image | no longer available ||
128
+
| runai\_job\_requested\_gpu\_memory | available also via API and renamed to: "runai\_requested\_gpu\_memory\_mb\_per\_workload" with different labels |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "GPU\_MEMORY\_REQUEST\_BYTES" metricType |
129
+
| runai\_job\_requested\_gpus | renamed to: "runai\_requested\_gpus\_per\_workload" with different labels ||
130
+
| runai\_job\_total\_runtime | renamed to: "runai\_run\_time\_seconds\_per\_workload" with different labels ||
131
+
| runai\_job\_total\_wait\_time | renamed to: "runai\_wait\_time\_seconds\_per\_workload" with different labels ||
132
+
| runai\_gpu\_memory\_used\_mebibytes\_per\_workload | available also via API and labels changed |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "GPU\_MEMORY\_USAGE\_BYTES" metricType |
133
+
| runai\_gpu\_memory\_used\_mebibytes\_per\_pod\_per\_gpu | available also via API and labels changed |https://app.run.ai/api/v1/workloads/{workloadId}/pods/{podId}/metrics ; with "GPU\_MEMORY\_USAGE\_BYTES\_PER\_GPU" metricType |
134
+
| runai\_node\_gpu\_used\_memory\_bytes | renamed and changed units: "runai\_gpu\_memory\_used\_mebibytes\_per\_node" ||
135
+
| runai\_node\_total\_memory\_bytes | renamed and changed units: "runai\_gpu\_memory\_total\_mebibytes\_per\_node" ||
136
+
| runai\_project\_info | labels changed ||||
137
+
| runai\_active\_job\_cpu\_limits | available also via API and renamed to: "runai\_cpu\_limits\_per\_active\_workload" |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "CPU\_LIMIT\_CORES" metricType |
138
+
| runai\_job\_cpu\_usage | available also via API and labels changed |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "CPU\_USAGE\_CORES" metricType |
139
+
| runai\_active\_job\_memory\_limits | available also via API and renamed to: "runai\_memory\_limits\_per\_active\_workload" |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "CPU\_MEMORY\_LIMIT\_BYTES" metricType |
140
140
| runai\_running\_job\_memory\_requested\_bytes | was a duplication of "runai\_active\_job\_memory\_requested\_bytes", see above ||
141
-
| runai\_job\_memory\_used\_bytes | available also via + labels changed |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "CPU\_MEMORY\_USAGE\_BYTES" metricType |
141
+
| runai\_job\_memory\_used\_bytes | available also via API and labels changed |https://app.run.ai/api/v1/workloads/{workloadId}/metrics ; with "CPU\_MEMORY\_USAGE\_BYTES" metricType |
142
142
| runai\_job\_swap\_memory\_used\_bytes | no longer available ||
0 commit comments