Skip to content

Commit e06b3a9

Browse files
RUN-16507 add expanded introduction
1 parent 708c25f commit e06b3a9

File tree

1 file changed

+21
-0
lines changed

1 file changed

+21
-0
lines changed

docs/admin/admin-ui-setup/dashboard-analysis.md

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,27 @@
22

33
The Run:ai Administration User Interface provides a set of dashboards that help you monitor Clusters, Cluster Nodes, Projects, and Workloads. This document provides the key metrics to monitor, how to assess them as well as suggested actions.
44

5+
Dashboards are used by system administrators to analyze and diagnose issues that relate to:
6+
7+
* Physical Resources.
8+
* Organization resource allocation and utilization.
9+
* Usage characteristics.
10+
11+
System administrators need to know important information about the physical resources that are currently being used. Important information such as:
12+
13+
* Resource health.
14+
* Available resources and their distribution.
15+
* Is there a lack of resources.
16+
* Are resources being utilized correctly.
17+
18+
With this information, system administrators can hone in on:
19+
20+
* How resources are allocated across the organization.
21+
* How the different organizational units utilized quotas and resources within those quotas.
22+
* The actual performance of the organizational units.
23+
24+
These dashboards give system administrators the ability to drill down to see details of the different types of workloads that each of the organizational units is running. These usage and performance metrics ensure that system administrators can then take actions to correct issues that affect performance.
25+
526
There are 5 dashboards:
627

728
* [**GPU/CPU Overview**](#gpucpu-overview-dashboard) dashboard—Provides information about what is happening right now in the cluster.

0 commit comments

Comments
 (0)