Skip to content

Conversation

CharlelieLrt
Copy link
Collaborator

PhysicsNeMo Pull Request

Description

To improve stability of CorrDiffr, this PR introduces non-regression for the end-to-end workflow of CorrDiff train.py and generate.py. The goal is to have non-regression tests that combine:

  • lead-time aware models
  • regression and diffusion architectures representative of an actual CorrDiff application
  • patching
  • latest performance optimizations (AMP< compiling, Apex GN, etc...)

More specifically, the PR introduces:

  • Regression and diffusion model checkpoints, generated with release v1.0.1
  • Reference loss data for CorrDiff training, generated with release v1.1.1
  • Non-regression test for CorrDiff regression/diffusion training, which ensures that results from current model implementation do not deviate from that obtained with checkpoints from v1.0.1
  • Non-regression tests for CorrDiff generation using both deterministic/stochastic sampler with model checkpoints and current APIs

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.
  • The CHANGELOG.md is up to date with these changes.
  • An issue is linked to this pull request.

Dependencies

CharlelieLrt and others added 3 commits July 16, 2025 20:47
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
@CharlelieLrt CharlelieLrt self-assigned this Jul 16, 2025
@CharlelieLrt CharlelieLrt added 2 - In Progress Currently a work in progress Earth-2 labels Jul 16, 2025
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Signed-off-by: Charlelie Laurent <claurent@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

2 - In Progress Currently a work in progress Earth-2

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant