What you would like to be added?
I have just gone through some sample codes of TrainerClient v2. However, I cant find any example of returning some useful metric like loss/accuracy of the distributed training run. Currently only the job id is returned
job_id = TrainerClient(namespace="kubeflow-system").train
or any better suggestions?
Why is this needed?
to pass the metrics back to the katib objective
Love this feature?
Give it a 👍 We prioritize the features with most 👍