From 08f22a94a38bce503ef9b008a5e4740eedf79093 Mon Sep 17 00:00:00 2001 From: Sherin Date: Sat, 17 May 2025 13:22:41 +0300 Subject: [PATCH 1/4] Updated hotfixes --- docs/home/changelog/hotfixes-2-18.md | 4 ++++ docs/home/changelog/hotfixes-2-19.md | 7 +++++++ docs/home/changelog/hotfixes-2-20.md | 6 ++++++ 3 files changed, 17 insertions(+) diff --git a/docs/home/changelog/hotfixes-2-18.md b/docs/home/changelog/hotfixes-2-18.md index 8ff00869ef..436de3f89f 100644 --- a/docs/home/changelog/hotfixes-2-18.md +++ b/docs/home/changelog/hotfixes-2-18.md @@ -12,6 +12,10 @@ The following is a list of the known and fixed issues for Run:ai V2.18. | Internal ID | Hotfix # | Description | | :---- | :---- | :---- | +| RUN-27640 | 2.18.104 | Fixed a security vulnerability in github.com.golang-jwt.jwt.v5 related to CVE-2025-30204 with severity HIGH. | +| RUN-27893 | 2.18.104 | Fixed an issue where workloads submitted with an invalid node port range would get stuck in "Creating" status. | +| RUN-28006 | 2.18.104 | Fixed an issue where tokens became invalid for the API server after one hour. | +| RUN-28097 | 2.18.104 | Fixed an issue where the allocated_gpu_count_per_gpu metric displayed incorrect data for fractional pods. | | RUN-26359 | 2.18.102 | Fixed an issue in CLI v2 where using the `--toleration` option required incorrect mandatory fields. | | RUN-26878 | 2.18.100 | Fixed an issue where, in some cases, previous inference request errors caused subsequent successful requests to report latency as NaN (Not a Number) instead of the actual value. As a result, these requests were not displayed in the UI latency graph, and the API returned NaN. | | RUN-26800 | 2.18.99 | Fixed an issue where the `runai list nodes` command in CLI v1 did not display the correct number of free GPUs when using GPU memory. | diff --git a/docs/home/changelog/hotfixes-2-19.md b/docs/home/changelog/hotfixes-2-19.md index f478778c12..34544e805f 100644 --- a/docs/home/changelog/hotfixes-2-19.md +++ b/docs/home/changelog/hotfixes-2-19.md @@ -10,6 +10,13 @@ The following is a list of the known and fixed issues for Run:ai V2.19. | Internal ID | Hotfix # | Description | | :---- | :---- | :---- | +| RUN-27295 | 2.19.113 | Fixed an issue in CLI v2 where the --node-type flag for inference workloads was not properly propagated to the pod specification. | +| RUN-27514 | 2.19.113 | Fixed an issue with incorrect calculation of the ALLOCATED_CPU_MEMORY_BYTES telemetry metric. | +| RUN-27521 | 2.19.113 | Fixed an issue where disabling the CPU quota feature in Admin UI did not remove existing CPU quotas from projects and departments. | +| RUN-27841 | 2.19.113 | Fixed an issue where workloads without a memory request failed validation in the workload-controller webhook. | +| RUN-28212 | 2.19.113 | Fixed a security vulnerability in `github.com.golang-jwt.jwt.v5` related to CVE-2025-30204 with severity HIGH. | +| RUN-27638 | 2.19.112 | Fixed a security vulnerability in `axios` related to CVE-2025-27152 with severity HIGH. | +| RUN-27867 | 2.19.112 | Fixed a security vulnerability in `github.com.golang.org.x.crypto` related to CVE-2025-22869 with severity HIGH. | | RUN-27944 | 2.19.111 | Fixed a security vulnerability in `github.com.golang-jwt.jwt.v4` related to CVE-2025-30204 with severity HIGH. | | RUN-26994 | 2.19.111 | Fixed an issue where session timeouts occurred when using the CLI. | | RUN-28097 | 2.19.111 | Fixed an issue where the `allocated_gpu_count_per_gpu` metric displayed incorrect data for fractional pods. | diff --git a/docs/home/changelog/hotfixes-2-20.md b/docs/home/changelog/hotfixes-2-20.md index b27be5afd0..116de10a2f 100644 --- a/docs/home/changelog/hotfixes-2-20.md +++ b/docs/home/changelog/hotfixes-2-20.md @@ -11,6 +11,12 @@ This section provides details on all hotfixes available for version 2.20. Hotfix | Version | Date | Internal ID | Description | |--|--|--|--| +| 2.20.43 | 15/05/2025 | RUN-27295 | Fixed an issue in CLI v2 where the `--node-type` flag for inference workloads was not properly propagated to the pod specification. | +| 2.20.43 | 15/05/2025 | RUN-27375| Fixed an issue where projects were not visible in the legacy job submission form, preventing users from selecting a target project. | +| 2.20.43 | 15/05/2025 | RUN-27841 | Fixed an issue where workloads without a memory request failed validation in the workload-controller webhook. | +| 2.20.42 | 13/05/2025 | RUN-27514 | Fixed an issue with incorrect calculation of the ALLOCATED_CPU_MEMORY_BYTES telemetry metric. | +| 2.20.42 | 13/05/2025 | RUN-27521 | Fixed an issue where disabling the CPU quota feature in Admin UI did not remove existing CPU quotas from projects and departments. | +| 2.20.42 | 13/05/2025 | RUN-28380 | Fixed a security vulnerability in `github.com.golang.org.x.crypto` related to CVE-2025-22869 with severity HIGH. | | 2.20.41 | 06/05/2025 | RUN-28241 | Fixed a security vulnerability in `github.com.golang-jwt.jwt.v5` related to CVE-2025-30204 with severity HIGH. | | 2.20.41 | 06/05/2025 | RUN-28097 | Fixed an issue where the `allocated_gpu_count_per_gpu metric` displayed incorrect data for fractional pods. | | 2.20.41 | 06/05/2025 | RUN-28006 | Fixed an issue where tokens became invalid for the API server after one hour. | From 6c873baae63b4cc3f6a874330aa47731d16d20ef Mon Sep 17 00:00:00 2001 From: Sherin Date: Mon, 19 May 2025 13:19:42 +0300 Subject: [PATCH 2/4] Update hotfixes-2-18.md --- docs/home/changelog/hotfixes-2-18.md | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/docs/home/changelog/hotfixes-2-18.md b/docs/home/changelog/hotfixes-2-18.md index 436de3f89f..962a6cc91f 100644 --- a/docs/home/changelog/hotfixes-2-18.md +++ b/docs/home/changelog/hotfixes-2-18.md @@ -12,10 +12,11 @@ The following is a list of the known and fixed issues for Run:ai V2.18. | Internal ID | Hotfix # | Description | | :---- | :---- | :---- | -| RUN-27640 | 2.18.104 | Fixed a security vulnerability in github.com.golang-jwt.jwt.v5 related to CVE-2025-30204 with severity HIGH. | +| RUN-27640 | 2.18.104 | Fixed a security vulnerability in `github.com.golang-jwt.jwt.v5` related to CVE-2025-30204 with severity HIGH. | | RUN-27893 | 2.18.104 | Fixed an issue where workloads submitted with an invalid node port range would get stuck in "Creating" status. | | RUN-28006 | 2.18.104 | Fixed an issue where tokens became invalid for the API server after one hour. | -| RUN-28097 | 2.18.104 | Fixed an issue where the allocated_gpu_count_per_gpu metric displayed incorrect data for fractional pods. | +| RUN-27543| 2.18.104 | Fixed an issue where the GPU quota for a department could not be edited because the maxAllowed parameter was not updated when the over quota weight feature flag was turned off. | +| RUN-28097 | 2.18.104 | Fixed an issue where the `allocated_gpu_count_per_gpu` metric displayed incorrect data for fractional pods. | | RUN-26359 | 2.18.102 | Fixed an issue in CLI v2 where using the `--toleration` option required incorrect mandatory fields. | | RUN-26878 | 2.18.100 | Fixed an issue where, in some cases, previous inference request errors caused subsequent successful requests to report latency as NaN (Not a Number) instead of the actual value. As a result, these requests were not displayed in the UI latency graph, and the API returned NaN. | | RUN-26800 | 2.18.99 | Fixed an issue where the `runai list nodes` command in CLI v1 did not display the correct number of free GPUs when using GPU memory. | From ac5286365c923db43e3e11b446d4c2834ccd7b7d Mon Sep 17 00:00:00 2001 From: Sherin Date: Mon, 19 May 2025 13:28:22 +0300 Subject: [PATCH 3/4] Updated hotfixes --- docs/home/changelog/hotfixes-2-18.md | 6 +++--- docs/home/changelog/hotfixes-2-19.md | 7 +++---- docs/home/changelog/hotfixes-2-20.md | 4 ++-- 3 files changed, 8 insertions(+), 9 deletions(-) diff --git a/docs/home/changelog/hotfixes-2-18.md b/docs/home/changelog/hotfixes-2-18.md index 962a6cc91f..cc60f1fee0 100644 --- a/docs/home/changelog/hotfixes-2-18.md +++ b/docs/home/changelog/hotfixes-2-18.md @@ -13,10 +13,10 @@ The following is a list of the known and fixed issues for Run:ai V2.18. | Internal ID | Hotfix # | Description | | :---- | :---- | :---- | | RUN-27640 | 2.18.104 | Fixed a security vulnerability in `github.com.golang-jwt.jwt.v5` related to CVE-2025-30204 with severity HIGH. | -| RUN-27893 | 2.18.104 | Fixed an issue where workloads submitted with an invalid node port range would get stuck in "Creating" status. | +| RUN-27893 | 2.18.104 | Fixed an issue where workloads submitted with an invalid node port range would get stuck in Creating status. | | RUN-28006 | 2.18.104 | Fixed an issue where tokens became invalid for the API server after one hour. | -| RUN-27543| 2.18.104 | Fixed an issue where the GPU quota for a department could not be edited because the maxAllowed parameter was not updated when the over quota weight feature flag was turned off. | -| RUN-28097 | 2.18.104 | Fixed an issue where the `allocated_gpu_count_per_gpu` metric displayed incorrect data for fractional pods. | +| RUN-27543| 2.18.104 | Fixed an issue where the GPU quota for a department could not be edited because the `maxAllowed` parameter was not updated when the over quota weight feature flag was turned off. | +| RUN-28097 | 2.18.104 | Fixed an issue where the `ALLOCATED_GPU_COUNT_PER_GPU` metric displayed incorrect data for fractional pods. | | RUN-26359 | 2.18.102 | Fixed an issue in CLI v2 where using the `--toleration` option required incorrect mandatory fields. | | RUN-26878 | 2.18.100 | Fixed an issue where, in some cases, previous inference request errors caused subsequent successful requests to report latency as NaN (Not a Number) instead of the actual value. As a result, these requests were not displayed in the UI latency graph, and the API returned NaN. | | RUN-26800 | 2.18.99 | Fixed an issue where the `runai list nodes` command in CLI v1 did not display the correct number of free GPUs when using GPU memory. | diff --git a/docs/home/changelog/hotfixes-2-19.md b/docs/home/changelog/hotfixes-2-19.md index 34544e805f..7e7e648274 100644 --- a/docs/home/changelog/hotfixes-2-19.md +++ b/docs/home/changelog/hotfixes-2-19.md @@ -10,16 +10,15 @@ The following is a list of the known and fixed issues for Run:ai V2.19. | Internal ID | Hotfix # | Description | | :---- | :---- | :---- | -| RUN-27295 | 2.19.113 | Fixed an issue in CLI v2 where the --node-type flag for inference workloads was not properly propagated to the pod specification. | -| RUN-27514 | 2.19.113 | Fixed an issue with incorrect calculation of the ALLOCATED_CPU_MEMORY_BYTES telemetry metric. | +| RUN-27295 | 2.19.113 | Fixed an issue in CLI v2 where the `--node-type` flag for inference workloads was not properly propagated to the pod specification. | +| RUN-27514 | 2.19.113 | Fixed an issue with incorrect calculation of the `ALLOCATED_CPU_MEMORY_BYTES` telemetry metric. | | RUN-27521 | 2.19.113 | Fixed an issue where disabling the CPU quota feature in Admin UI did not remove existing CPU quotas from projects and departments. | | RUN-27841 | 2.19.113 | Fixed an issue where workloads without a memory request failed validation in the workload-controller webhook. | -| RUN-28212 | 2.19.113 | Fixed a security vulnerability in `github.com.golang-jwt.jwt.v5` related to CVE-2025-30204 with severity HIGH. | | RUN-27638 | 2.19.112 | Fixed a security vulnerability in `axios` related to CVE-2025-27152 with severity HIGH. | | RUN-27867 | 2.19.112 | Fixed a security vulnerability in `github.com.golang.org.x.crypto` related to CVE-2025-22869 with severity HIGH. | | RUN-27944 | 2.19.111 | Fixed a security vulnerability in `github.com.golang-jwt.jwt.v4` related to CVE-2025-30204 with severity HIGH. | | RUN-26994 | 2.19.111 | Fixed an issue where session timeouts occurred when using the CLI. | -| RUN-28097 | 2.19.111 | Fixed an issue where the `allocated_gpu_count_per_gpu` metric displayed incorrect data for fractional pods. | +| RUN-28097 | 2.19.111 | Fixed an issue where the`ALLOCATED_GPU_COUNT_PER_GPU` metric displayed incorrect data for fractional pods. | | RUN-27837 | 2.19.110 | Fixed an issue where a node pool’s placement strategy stopped functioning correctly after being edited. | | RUN-27628 | 2.19.109 | Fixed an issue where a node pool could remain stuck in Updating status in certain cases. | | RUN-27640 | 2.19.109 | Fixed a security vulnerability in `github.com.golang-jwt.jwt.v5` related to CVE-2025-30204 with severity HIGH. | diff --git a/docs/home/changelog/hotfixes-2-20.md b/docs/home/changelog/hotfixes-2-20.md index 116de10a2f..018835bec6 100644 --- a/docs/home/changelog/hotfixes-2-20.md +++ b/docs/home/changelog/hotfixes-2-20.md @@ -14,11 +14,11 @@ This section provides details on all hotfixes available for version 2.20. Hotfix | 2.20.43 | 15/05/2025 | RUN-27295 | Fixed an issue in CLI v2 where the `--node-type` flag for inference workloads was not properly propagated to the pod specification. | | 2.20.43 | 15/05/2025 | RUN-27375| Fixed an issue where projects were not visible in the legacy job submission form, preventing users from selecting a target project. | | 2.20.43 | 15/05/2025 | RUN-27841 | Fixed an issue where workloads without a memory request failed validation in the workload-controller webhook. | -| 2.20.42 | 13/05/2025 | RUN-27514 | Fixed an issue with incorrect calculation of the ALLOCATED_CPU_MEMORY_BYTES telemetry metric. | +| 2.20.42 | 13/05/2025 | RUN-27514 | Fixed an issue with incorrect calculation of the `ALLOCATED_CPU_MEMORY_BYTES` telemetry metric. | | 2.20.42 | 13/05/2025 | RUN-27521 | Fixed an issue where disabling the CPU quota feature in Admin UI did not remove existing CPU quotas from projects and departments. | | 2.20.42 | 13/05/2025 | RUN-28380 | Fixed a security vulnerability in `github.com.golang.org.x.crypto` related to CVE-2025-22869 with severity HIGH. | | 2.20.41 | 06/05/2025 | RUN-28241 | Fixed a security vulnerability in `github.com.golang-jwt.jwt.v5` related to CVE-2025-30204 with severity HIGH. | -| 2.20.41 | 06/05/2025 | RUN-28097 | Fixed an issue where the `allocated_gpu_count_per_gpu metric` displayed incorrect data for fractional pods. | +| 2.20.41 | 06/05/2025 | RUN-28097 | Fixed an issue where the `ALLOCATED_GPU_COUNT_PER_GPU` metric displayed incorrect data for fractional pods. | | 2.20.41 | 06/05/2025 | RUN-28006 | Fixed an issue where tokens became invalid for the API server after one hour. | | 2.20.41 | 06/05/2025 | RUN-27638 | Fixed a security vulnerability in axios related to CVE-2025-27152 with severity HIGH. | | 2.20.40 | 30/04/2025 | RUN-27837 | Fixed an issue where a node pool’s placement strategy stopped functioning correctly after being edited.| From db1528a4fa71e3ea4d109426b76943b8a390cb3c Mon Sep 17 00:00:00 2001 From: Sherin Date: Mon, 19 May 2025 13:30:29 +0300 Subject: [PATCH 4/4] Updated hotfixes --- docs/home/changelog/hotfixes-2-19.md | 2 +- docs/home/changelog/hotfixes-2-20.md | 2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git a/docs/home/changelog/hotfixes-2-19.md b/docs/home/changelog/hotfixes-2-19.md index 7e7e648274..591c120319 100644 --- a/docs/home/changelog/hotfixes-2-19.md +++ b/docs/home/changelog/hotfixes-2-19.md @@ -12,7 +12,7 @@ The following is a list of the known and fixed issues for Run:ai V2.19. | :---- | :---- | :---- | | RUN-27295 | 2.19.113 | Fixed an issue in CLI v2 where the `--node-type` flag for inference workloads was not properly propagated to the pod specification. | | RUN-27514 | 2.19.113 | Fixed an issue with incorrect calculation of the `ALLOCATED_CPU_MEMORY_BYTES` telemetry metric. | -| RUN-27521 | 2.19.113 | Fixed an issue where disabling the CPU quota feature in Admin UI did not remove existing CPU quotas from projects and departments. | +| RUN-27521 | 2.19.113 | Fixed an issue where disabling CPU quota in the General settings did not remove existing CPU quotas from projects and departments. | | RUN-27841 | 2.19.113 | Fixed an issue where workloads without a memory request failed validation in the workload-controller webhook. | | RUN-27638 | 2.19.112 | Fixed a security vulnerability in `axios` related to CVE-2025-27152 with severity HIGH. | | RUN-27867 | 2.19.112 | Fixed a security vulnerability in `github.com.golang.org.x.crypto` related to CVE-2025-22869 with severity HIGH. | diff --git a/docs/home/changelog/hotfixes-2-20.md b/docs/home/changelog/hotfixes-2-20.md index 018835bec6..3cae9d278c 100644 --- a/docs/home/changelog/hotfixes-2-20.md +++ b/docs/home/changelog/hotfixes-2-20.md @@ -15,7 +15,7 @@ This section provides details on all hotfixes available for version 2.20. Hotfix | 2.20.43 | 15/05/2025 | RUN-27375| Fixed an issue where projects were not visible in the legacy job submission form, preventing users from selecting a target project. | | 2.20.43 | 15/05/2025 | RUN-27841 | Fixed an issue where workloads without a memory request failed validation in the workload-controller webhook. | | 2.20.42 | 13/05/2025 | RUN-27514 | Fixed an issue with incorrect calculation of the `ALLOCATED_CPU_MEMORY_BYTES` telemetry metric. | -| 2.20.42 | 13/05/2025 | RUN-27521 | Fixed an issue where disabling the CPU quota feature in Admin UI did not remove existing CPU quotas from projects and departments. | +| 2.20.42 | 13/05/2025 | RUN-27521 | Fixed an issue where disabling CPU quota in the General settings did not remove existing CPU quotas from projects and departments. | | 2.20.42 | 13/05/2025 | RUN-28380 | Fixed a security vulnerability in `github.com.golang.org.x.crypto` related to CVE-2025-22869 with severity HIGH. | | 2.20.41 | 06/05/2025 | RUN-28241 | Fixed a security vulnerability in `github.com.golang-jwt.jwt.v5` related to CVE-2025-30204 with severity HIGH. | | 2.20.41 | 06/05/2025 | RUN-28097 | Fixed an issue where the `ALLOCATED_GPU_COUNT_PER_GPU` metric displayed incorrect data for fractional pods. |