Skip to content

Releases: NVIDIA/cloudai

v1.3.beta21

11 Jun 15:57
d6eb739
Compare
Choose a tag to compare
v1.3.beta21 Pre-release
Pre-release

What's Changed

Full Changelog: v1.3.beta20...v1.3.beta21

v1.3.beta20

06 Jun 10:06
a4d99ca
Compare
Choose a tag to compare
v1.3.beta20 Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: v1.3.beta19...v1.3.beta20

v1.3.beta19

05 Jun 10:35
8953247
Compare
Choose a tag to compare
v1.3.beta19 Pre-release
Pre-release

What's Changed

  • Refactor supports_gpu_directives to focus on GresTypes by @TaekyungHeo in #556

Full Changelog: v1.3.beta18...v1.3.beta19

v1.3.beta18

04 Jun 15:00
2f23f0d
Compare
Choose a tag to compare
v1.3.beta18 Pre-release
Pre-release

What's Changed

  • Allow sweeps for number of nodes by @amaslenn in #487
  • Fix invalid type for image when cache is disabled by @amaslenn in #554
  • BaseRunner: rename callbacks and make them synchronous by @amaslenn in #553

Full Changelog: v1.3.beta17...v1.3.beta18

v1.3.beta17

03 Jun 17:00
4edf281
Compare
Choose a tag to compare
v1.3.beta17 Pre-release
Pre-release

What's Changed

  • Make sure install status is populated to all duplicates by @amaslenn in #545
  • Use copies for venv creation + fix tests by @amaslenn in #546
  • Control if home folder should be mounted into container for slurm by @amaslenn in #547
  • Add LLAMA3 8b to NeMo acceptance by @TaekyungHeo in #532
  • Return absolute path for cached Docker image in installed_path method by @TaekyungHeo in #549
  • Allow val_check_interval to be int, float, or list of both by @amaslenn in #551
  • Support single node configuration for NIXLBench by @amaslenn in #552
  • Make sure mark_as_installed respects system config by @amaslenn in #548
  • NIXL reporting by @amaslenn in #550

Full Changelog: v1.3.beta16...v1.3.beta17

v1.3.beta16

28 May 15:04
d947e02
Compare
Choose a tag to compare
v1.3.beta16 Pre-release
Pre-release

What's Changed

Full Changelog: v1.3.beta15...v1.3.beta16

v1.3.beta15

27 May 16:26
c051a27
Compare
Choose a tag to compare
v1.3.beta15 Pre-release
Pre-release

What's Changed

  • Added support for additional args in cmd_args in chakra replay workload by @Eli-Siegel-nvidia in #542
  • Add GPU directive support check to SlurmSystem and use it in command gen by @TaekyungHeo in #541

New Contributors

Full Changelog: v1.3.beta14...v1.3.beta15

v1.3.beta14

23 May 16:34
f87f8df
Compare
Choose a tag to compare
v1.3.beta14 Pre-release
Pre-release

What's Changed

  • Set srun job name to "-CloudAI_install_docker_image.%Y%m%d_%H%M%S" by @TaekyungHeo in #544

Full Changelog: v1.3.beta13...v1.3.beta14

v1.3.beta13

23 May 10:25
09dff16
Compare
Choose a tag to compare
v1.3.beta13 Pre-release
Pre-release

What's Changed

Full Changelog: v1.3.beta12...v1.3.beta13

v1.3.beta12

21 May 14:35
37f6f1b
Compare
Choose a tag to compare
v1.3.beta12 Pre-release
Pre-release

What's Changed

Full Changelog: v1.3.beta11...v1.3.beta12