Skip to content

Please add user custom return value for TrainerClient().train run #2749

@ram4444

Description

@ram4444

What you would like to be added?

I have just gone through some sample codes of TrainerClient v2. However, I cant find any example of returning some useful metric like loss/accuracy of the distributed training run. Currently only the job id is returned

job_id = TrainerClient(namespace="kubeflow-system").train

or any better suggestions?

Why is this needed?

to pass the metrics back to the katib objective

Love this feature?

Give it a 👍 We prioritize the features with most 👍

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions