Skip to content

Conversation

@helenxie-bit
Copy link
Contributor

What this PR does / why we need it:
Add an e2e test in the test_e2e_train_api.py for the train API.

Which issue(s) this PR fixes (optional, in Fixes #<issue number>, #<issue number>, ... format, will close the issue(s) when PR gets merged):
Fixes #

Checklist:

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
@coveralls
Copy link

coveralls commented Aug 9, 2024

Pull Request Test Coverage Report for Build 12449362407

Details

  • 0 of 0 changed or added relevant lines in 0 files are covered.
  • No unchanged relevant lines lost coverage.
  • Overall coverage remained the same at 100.0%

Totals Coverage Status
Change from base Build 12381609583: 0.0%
Covered Lines: 85
Relevant Lines: 85

💛 - Coveralls

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
@helenxie-bit helenxie-bit changed the title Add e2e test for train API [WIP]Add e2e test for train API Aug 9, 2024
@helenxie-bit helenxie-bit changed the title [WIP]Add e2e test for train API [WIP] Add e2e test for train API Aug 9, 2024
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
@helenxie-bit
Copy link
Contributor Author

Run the e2e test for the train API as a separate GitHub Action.

Can you try to separate them and see if issue will be resolved ? cc @kubeflow/wg-training-leads

@andreyvelich I've separated the e2e test for train API and now it works. Please review when you have time.

Copy link
Member

@andreyvelich andreyvelich left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for this overall lgtm, just small comment.
/assign @deepanker13 @kubeflow/wg-training-leads @Electronic-Waste

@deepanker13
Copy link
Contributor

/lgtm
Thanks @helenxie-bit

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
@google-oss-prow google-oss-prow bot removed the lgtm label Dec 20, 2024
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
@helenxie-bit
Copy link
Contributor Author

helenxie-bit commented Dec 20, 2024

I've updated the Kubernetes version to v1.31.4. Please review when you have time @andreyvelich @Electronic-Waste @kubeflow/wg-training-leads

Copy link
Member

@Electronic-Waste Electronic-Waste left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Basically LGTM. I left some comments for you @helenxie-bit

strategy:
fail-fast: false
matrix:
kubernetes-version: ["v1.31.4"]
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just to save compute resources, I think for now we can just run this test on a single k8s version, since we run the rests E2E tests on the all versions.
WDYT @Electronic-Waste ?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, I agree. Maybe we can select one k8s version from this list:)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK, let me change the version to v1.30.6. And we can update it if needed in the future.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

given that we support 1.28-1.31, I would suggest that we run our integration tests on 1.29, 1.30, 1.31, we can update it in the following PR.
For the train API tests, I think running it on 1.31 should be sufficient.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sounds good. So I think we will still keep the v1.31.4 version.

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Copy link
Member

@andreyvelich andreyvelich left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for this effort @helenxie-bit!
/lgtm
/approve
/hold cancel

@google-oss-prow
Copy link

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: andreyvelich, tenzen-y

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:
  • OWNERS [andreyvelich,tenzen-y]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@google-oss-prow google-oss-prow bot merged commit d7f69e8 into kubeflow:master Dec 22, 2024
55 checks passed
szaher pushed a commit to szaher/sdk that referenced this pull request Jun 4, 2025
* add e2e test for train API

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix peft import error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update settings of the job

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix error detection

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* resolve conflict

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* resolve conflict

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* resolve conflict

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix NoneType error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* test bug

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* find bug

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* find bug

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* find bug

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* add storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* reduce pvc size

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* set storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* set storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* set storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* set storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* use gpu

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* use gpu

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* use gpu

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix 'set_device' error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* add timeout error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix typo

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update e2e test for train api

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* add num_labels

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update pip install

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check disk space

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* change sequence of e2e tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* add clean-up after each e2e test of pytorchjob

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function-add check disk

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check docker volumes

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check docker directory

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update pip install and 'num_workers'

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update pip install and 'num_workers'

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update pip install

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* change the value of 'clean_pod_policy'

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* change the value of 'update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check docker volumes

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check docker volumes

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* stop the controller and restart it again to clean up

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* separate e2e test for train api

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix parameter of namespace

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* reduce resources

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* separate e2e test for train API

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* remove go setup

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* adjust the version of k8s

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* move test file to new place

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix typos

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update install packages

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* build and verify images of storage-intializer and trainer

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix image build error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix image build error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check disk space

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* make 'setup-storage-initializer-and-trainer' executable

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* separate step of loading images

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check disk space after loading image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* clean up and check disk space

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* prune docker build cache

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* prune docker build cache

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* adjust sequence of building and loading images

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* move working directory

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* delete moving working directory

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* use 'docker system prune'

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* make the format of the commands to be consistent

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update base image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update base image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update base image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* delete unnecessary space clear and check code

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* merge e2e test for train api into integration tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check for timeout error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix name of trainer image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix env of building storage initializer image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* clean format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* skip e2e test for train API when use scheduling

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* Update name of fileholder

Co-authored-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>
Signed-off-by: Hezhi (Helen) Xie <hezxie@ucdavis.edu>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* separate e2e test for train API

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* move test script

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update path to test script

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update path to test script

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update kubernetes version

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update kubernetes version

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* adjust kubernetes version to 1.30.6

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* adjust kubernetes version to 1.31.4

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

---------

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: Hezhi (Helen) Xie <hezxie@ucdavis.edu>
Co-authored-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>
szaher pushed a commit to szaher/sdk that referenced this pull request Jun 4, 2025
* add e2e test for train API

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix peft import error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update settings of the job

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix error detection

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* resolve conflict

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* resolve conflict

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* resolve conflict

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix NoneType error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* test bug

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* find bug

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* find bug

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* find bug

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* add storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* reduce pvc size

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* set storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* set storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* set storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* set storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* use gpu

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* use gpu

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* use gpu

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix 'set_device' error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* add timeout error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix typo

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update e2e test for train api

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* add num_labels

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update pip install

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check disk space

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* change sequence of e2e tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* add clean-up after each e2e test of pytorchjob

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function-add check disk

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check docker volumes

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check docker directory

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update pip install and 'num_workers'

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update pip install and 'num_workers'

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update pip install

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* change the value of 'clean_pod_policy'

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* change the value of 'update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check docker volumes

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check docker volumes

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* stop the controller and restart it again to clean up

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* separate e2e test for train api

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix parameter of namespace

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* reduce resources

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* separate e2e test for train API

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* remove go setup

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* adjust the version of k8s

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* move test file to new place

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix typos

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update install packages

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* build and verify images of storage-intializer and trainer

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix image build error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix image build error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check disk space

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* make 'setup-storage-initializer-and-trainer' executable

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* separate step of loading images

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check disk space after loading image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* clean up and check disk space

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* prune docker build cache

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* prune docker build cache

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* adjust sequence of building and loading images

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* move working directory

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* delete moving working directory

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* use 'docker system prune'

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* make the format of the commands to be consistent

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update base image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update base image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update base image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* delete unnecessary space clear and check code

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* merge e2e test for train api into integration tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check for timeout error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix name of trainer image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix env of building storage initializer image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* clean format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* skip e2e test for train API when use scheduling

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* Update name of fileholder

Co-authored-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>
Signed-off-by: Hezhi (Helen) Xie <hezxie@ucdavis.edu>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* separate e2e test for train API

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* move test script

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update path to test script

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update path to test script

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update kubernetes version

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update kubernetes version

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* adjust kubernetes version to 1.30.6

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* adjust kubernetes version to 1.31.4

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

---------

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: Hezhi (Helen) Xie <hezxie@ucdavis.edu>
Co-authored-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>
szaher pushed a commit to szaher/sdk that referenced this pull request Jun 4, 2025
* add e2e test for train API

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix peft import error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update settings of the job

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix error detection

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* resolve conflict

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* resolve conflict

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* resolve conflict

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix NoneType error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* test bug

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* find bug

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* find bug

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* find bug

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* add storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* reduce pvc size

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* set storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* set storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* set storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* set storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* use gpu

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* use gpu

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* use gpu

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix 'set_device' error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* add timeout error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix typo

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update e2e test for train api

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* add num_labels

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update pip install

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check disk space

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* change sequence of e2e tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* add clean-up after each e2e test of pytorchjob

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function-add check disk

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check docker volumes

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check docker directory

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update pip install and 'num_workers'

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update pip install and 'num_workers'

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update pip install

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* change the value of 'clean_pod_policy'

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* change the value of 'update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check docker volumes

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check docker volumes

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* stop the controller and restart it again to clean up

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* separate e2e test for train api

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix parameter of namespace

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* reduce resources

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* separate e2e test for train API

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* remove go setup

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* adjust the version of k8s

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* move test file to new place

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix typos

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update install packages

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* build and verify images of storage-intializer and trainer

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix image build error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix image build error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check disk space

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* make 'setup-storage-initializer-and-trainer' executable

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* separate step of loading images

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check disk space after loading image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* clean up and check disk space

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* prune docker build cache

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* prune docker build cache

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* adjust sequence of building and loading images

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* move working directory

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* delete moving working directory

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* use 'docker system prune'

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* make the format of the commands to be consistent

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update base image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update base image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update base image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* delete unnecessary space clear and check code

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* merge e2e test for train api into integration tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check for timeout error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix name of trainer image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix env of building storage initializer image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* clean format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* skip e2e test for train API when use scheduling

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* Update name of fileholder

Co-authored-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>
Signed-off-by: Hezhi (Helen) Xie <hezxie@ucdavis.edu>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* separate e2e test for train API

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* move test script

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update path to test script

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update path to test script

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update kubernetes version

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update kubernetes version

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* adjust kubernetes version to 1.30.6

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* adjust kubernetes version to 1.31.4

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

---------

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: Hezhi (Helen) Xie <hezxie@ucdavis.edu>
Co-authored-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>
szaher pushed a commit to szaher/sdk that referenced this pull request Jun 5, 2025
* add e2e test for train API

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix peft import error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update settings of the job

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix error detection

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* resolve conflict

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* resolve conflict

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* resolve conflict

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix NoneType error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* test bug

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* find bug

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* find bug

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* find bug

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* add storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* reduce pvc size

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* set storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* set storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* set storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* set storage_config

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* use gpu

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* use gpu

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* use gpu

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix 'set_device' error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* add timeout error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix typo

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update e2e test for train api

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* add num_labels

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update pip install

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check disk space

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* change sequence of e2e tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* add clean-up after each e2e test of pytorchjob

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function-add check disk

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check docker volumes

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check docker directory

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update pip install and 'num_workers'

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update pip install and 'num_workers'

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update pip install

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* change the value of 'clean_pod_policy'

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* change the value of 'update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check docker volumes

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check docker volumes

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* stop the controller and restart it again to clean up

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update cleanup function

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* separate e2e test for train api

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix parameter of namespace

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* reduce resources

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* separate e2e test for train API

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* remove go setup

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* adjust the version of k8s

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* move test file to new place

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix typos

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update install packages

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* build and verify images of storage-intializer and trainer

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix image build error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix image build error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check disk space

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* make 'setup-storage-initializer-and-trainer' executable

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* separate step of loading images

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check disk space after loading image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* clean up and check disk space

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* prune docker build cache

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* prune docker build cache

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* adjust sequence of building and loading images

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* move working directory

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* delete moving working directory

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* use 'docker system prune'

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* make the format of the commands to be consistent

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update base image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update base image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update base image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* delete unnecessary space clear and check code

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* merge e2e test for train api into integration tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* check for timeout error

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix name of trainer image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix env of building storage initializer image

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* clean format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* skip e2e test for train API when use scheduling

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* Update name of fileholder

Co-authored-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>
Signed-off-by: Hezhi (Helen) Xie <hezxie@ucdavis.edu>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* separate e2e test for train API

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* fix format

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* move test script

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update path to test script

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update path to test script

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update kubernetes version

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* update kubernetes version

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* rerun tests

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* adjust kubernetes version to 1.30.6

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

* adjust kubernetes version to 1.31.4

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>

---------

Signed-off-by: helenxie-bit <helenxiehz@gmail.com>
Signed-off-by: Hezhi (Helen) Xie <hezxie@ucdavis.edu>
Co-authored-by: Andrey Velichkevich <andrey.velichkevich@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants