Skip to content

Conversation

kapil27
Copy link
Contributor

@kapil27 kapil27 commented Oct 17, 2025

What this PR does / why we need it:

Which issue(s) this PR fixes (optional, in Fixes #<issue number>, #<issue number>, ... format, will close the issue(s) when PR gets merged):
Fixes #2894

Checklist:

  • Docs included if any changes are user facing

…port

Signed-off-by: kapil27 <knema@redhat.com>
@kapil27 kapil27 changed the title feat(manager): add controller manager configuration and configmap sup… feat: add controller manager configuration helm chart Oct 17, 2025
@coveralls
Copy link

coveralls commented Oct 17, 2025

Pull Request Test Coverage Report for Build 18674107727

Warning: This coverage report may be inaccurate.

This pull request's base commit is no longer the HEAD commit of its target branch. This means it includes changes from outside the original pull request, including, potentially, unrelated coverage changes.

Details

  • 0 of 0 changed or added relevant lines in 0 files are covered.
  • 112 unchanged lines in 5 files lost coverage.
  • Overall coverage decreased (-0.5%) to 51.675%

Files with Coverage Reduction New Missed Lines %
pkg/runtime/framework/plugins/torch/torch.go 5 95.2%
pkg/webhooks/trainingruntime_webhook.go 13 60.0%
pkg/runtime/framework/plugins/mpi/mpi.go 18 85.27%
pkg/runtime/framework/plugins/jobset/builder.go 24 0.0%
pkg/runtime/runtime.go 52 54.74%
Totals Coverage Status
Change from base Build 18562606347: -0.5%
Covered Lines: 1265
Relevant Lines: 2448

💛 - Coveralls

@andreyvelich
Copy link
Member

Thanks for opening this @kapil27!
We are planning to cut v2.1.0-rc.0 today, is this PR ready to review ?

{{/*
Create the name of the manager service account.
*/}}
{{- define "trainer.manager.serviceAccount.name" -}}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why do we need this ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You're right , we don't!
Removed the duplicate trainer.manager.serviceAccount.name helper from manager/_helpers.tpl. The helper already exists in rbac/_helpers.tpl

# Leader election configuration
leaderElection:
leaderElect: {{ gt (.Values.manager.replicas | int) 1 }}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes! Fixed the leader election configuration to default to true

Copy link
Member

@andreyvelich andreyvelich left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@kapil27 Can you also remove the post-fix generation from the ConfigMap in kustomize manifests:

- name: kubeflow-trainer-config
files:
- controller_manager_config.yaml

of: ConfigMap
- equal:
path: metadata.name
value: kubeflow-trainer-controller-manager-config
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The name of the config map is:

kubeflow-trainer-config

@andreyvelich
Copy link
Member

@kapil27 Did you get a chance to address the comments ?

Signed-off-by: kapil27 <knema@redhat.com>
Copy link
Member

@andreyvelich andreyvelich left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @kapil27!
/lgtm
/approve

@google-oss-prow
Copy link

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: andreyvelich

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@andreyvelich
Copy link
Member

@kapil27 Are there any other changes you would like to make to this PR before we move forward ?

@kapil27 kapil27 marked this pull request as ready for review October 21, 2025 12:08
@google-oss-prow google-oss-prow bot merged commit a2797af into kubeflow:master Oct 21, 2025
32 of 33 checks passed
@google-oss-prow google-oss-prow bot added this to the v2.1 milestone Oct 21, 2025
@kapil27
Copy link
Contributor Author

kapil27 commented Oct 21, 2025

@andreyvelich I’ve addressed all the comments and completed initial testing. It looks good on my end. ready to go ahead.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Trainer Config is missing in Helm Charts

3 participants