contrib: add example for enabling per-container RDT monitoring #228

marquiz · 2025-09-04T12:12:36Z

No description provided.

marquiz · 2025-09-04T12:15:23Z

klihub · 2025-09-08T08:15:19Z

contrib/kustomize/samples/rdt-monitoring/initcontainers-patch.yaml

+            delete() {
+                # MON group is reaped as part of the CLOS (by the runtime) if it was under
+                # a dedicated CLOS created for this container
+                if [ "$closid" != "$id" ]; then


$closid ? Shouldn't this be [ "$clos" != "$id" ] ?

YES, well-spotted, thanks. Fixed

Signed-off-by: Markus Lehtonen <markus.lehtonen@intel.com>

mikebrow

is it necessary to include dependency installation of the hook injector here?

contrib/kustomize/samples/rdt-monitoring/README.md

marquiz · 2025-09-09T08:11:28Z

is it necessary to include dependency installation of the hook injector here?

This is using the hook-injector as a dependency, see the kustomization.yaml

...
resources:
  - ../../hook-injector/unstable

Iow, this takes the hook-injector sample deployment as a base and does customization on top of that.

yonch · 2025-09-09T13:59:20Z

Hi team! Got a reference to this PR from @kad . This is a nice, simple approach! 👍

I'm working on an NRI-based container monitor that handles pre-existing containers: unvariance/collector#252

One of the problems is how to reliably pull all tasks of a cgroup into the resctrl group given tasks are live (they can fork as we're adding tasks, creating coverage gaps). If you have any feedback please share.

mikebrow

LGTM

klihub · 2025-09-10T07:41:48Z

Hi team! Got a reference to this PR from @kad . This is a nice, simple approach! 👍

I'm working on an NRI-based container monitor that handles pre-existing containers: unvariance/collector#252

One of the problems is how to reliably pull all tasks of a cgroup into the resctrl group given tasks are live (they can fork as we're adding tasks, creating coverage gaps). If you have any feedback please share.

@yonch I think there is unfortunately no easy reliable, yet unintrusive way of doing that. A potentially intrusive way is cgroup freezing. Freeze the cgroup of the container (or the whole pod), wait for it to get frozen, assign all the tasks to the resctrl group (if this works while the cgroup/task is frozen, haven't tried it), then unfreeze the cgroup.

marquiz · 2025-09-10T09:16:17Z

@yonch I think there is unfortunately no easy reliable, yet unintrusive way of doing that. A potentially intrusive way is cgroup freezing. Freeze the cgroup of the container (or the whole pod), wait for it to get frozen, assign all the tasks to the resctrl group (if this works while the cgroup/task is frozen, haven't tried it), then unfreeze the cgroup.

Yes, unfortunately I cannot think of any other race-free approach than the freezer which obviously is a BIG hammer. If you ask me the kernel should provide a simple way to do the migration but 🤷‍♂️

marquiz force-pushed the devel/rdt-monitoring-kustomize branch from a074068 to 7864431 Compare September 4, 2025 12:14

klihub reviewed Sep 8, 2025

View reviewed changes

contrib: add example for enabling per-container RDT monitoring

91fbf06

Signed-off-by: Markus Lehtonen <markus.lehtonen@intel.com>

marquiz force-pushed the devel/rdt-monitoring-kustomize branch from 7864431 to 91fbf06 Compare September 8, 2025 08:47

klihub requested review from chrishenzie and mikebrow September 8, 2025 10:59

klihub approved these changes Sep 8, 2025

View reviewed changes

mikebrow reviewed Sep 8, 2025

View reviewed changes

contrib/kustomize/samples/rdt-monitoring/README.md Show resolved Hide resolved

contrib/kustomize/samples/rdt-monitoring/README.md Show resolved Hide resolved

klihub requested a review from mikebrow September 9, 2025 13:28

mikebrow approved these changes Sep 9, 2025

View reviewed changes

klihub merged commit 3c85968 into containerd:main Sep 10, 2025
16 checks passed

marquiz deleted the devel/rdt-monitoring-kustomize branch September 10, 2025 09:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

contrib: add example for enabling per-container RDT monitoring #228

contrib: add example for enabling per-container RDT monitoring #228

Uh oh!

marquiz commented Sep 4, 2025

Uh oh!

marquiz commented Sep 4, 2025

Uh oh!

klihub Sep 8, 2025

Uh oh!

marquiz Sep 8, 2025

Uh oh!

mikebrow left a comment

Uh oh!

Uh oh!

Uh oh!

marquiz commented Sep 9, 2025

Uh oh!

yonch commented Sep 9, 2025

Uh oh!

mikebrow left a comment

Uh oh!

klihub commented Sep 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

marquiz commented Sep 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

contrib: add example for enabling per-container RDT monitoring #228

contrib: add example for enabling per-container RDT monitoring #228

Uh oh!

Conversation

marquiz commented Sep 4, 2025

Uh oh!

marquiz commented Sep 4, 2025

Uh oh!

klihub Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

marquiz Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

mikebrow left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

marquiz commented Sep 9, 2025

Uh oh!

yonch commented Sep 9, 2025

Uh oh!

mikebrow left a comment

Choose a reason for hiding this comment

Uh oh!

klihub commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

marquiz commented Sep 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

klihub commented Sep 10, 2025 •

edited

Loading