✨ LeNet BatchEnsemble and Deep Ensemble + minor bugfixes #137

tonyzamyatin · 2025-03-07T12:33:55Z

This PR introduces the following improvements and fixes:

Bug Fixes: Address minor issues in some of the existing LeNet configs.
BatchEnsemble Wrapper: Implements a wrapper that replicates the input batch num_estimator times, eliminating the need to hardcode this for each model.
Documentation Updates: Enhances the documentation to reflect the usage of batch ensemble layers and the classification routine, aligning with available wrappers and transforms.
LeNet BatchEnsemble Implementation: Adds BatchEnsemble support for LeNet, along with new experiment configurations for both Deep Ensemble and BatchEnsemble models (tested manually).

o-laurent · 2025-03-07T12:45:13Z

We have long wanted to do a BatchEnsemble wrapper, so thank you for this. Before I get to looking at your PR in detail, don't hesitate to enable the pre-commits as detailed in the contributing page. By the way, you could directly enable them in your dockerfile in #134?

tonyzamyatin · 2025-03-07T13:04:23Z

Hi @o-laurent,

currently I have a fork with where I play around and if I implement something new or fiy a bug I make a new branch and create a PR.

I have the pre-commit checks enabled on my fork and manually run the hooks before I make commits.

Are there any problems, or ways to automate this, just because you're asking?

And yeah I can add that to the Dockerfile👌

o-laurent · 2025-03-07T13:12:54Z

No worries, I'm saying this to avoid failing GitHub workflows due to linting issues as in test #1601.

If the docker image is made for contributions, then I think that it makes sense to enforce that the pre-commit hooks are installed. When installed, they should run automatically at commit time - and not manually - unless you use the --no-verify flag.

tonyzamyatin · 2025-03-07T13:28:24Z

Yeah you're right. Definitely, makes sense to enable pre-commit hooks in the Dockerfile.

Hmmm when I commit something onto the fork the pre-commit hook doesn't automatically run, only the pre-commit checks.🧐

o-laurent · 2025-03-07T13:32:07Z

Hmmm when I commit something onto the fork the pre-commit hook doesn't automatically run, only the pre-commit checks.🧐

What do you mean by pre-commit check (vs. pre-commit hooks)?

Normally, pre-commit should make committing impossible when any of the hooks fail, in this case "ruff-check".

tonyzamyatin · 2025-03-07T13:36:25Z

Sorry for the unclarity.

The commits get rejected if the pre-commit hook doesn't successfully run, but the code isn't automatically formatted on commit. I have to run the pre-commit hook manually and then commit again.

o-laurent · 2025-03-07T13:39:21Z

On my side, I just have to add the modified files and commit again. Anyway, this doesn't explain how you managed to push code that made our test fail 😄 but it's no big deal at all.

tonyzamyatin · 2025-03-07T13:42:27Z

Exactly, I do the same.

Don't know how this happens lol 😂

tonyzamyatin · 2025-03-07T13:44:23Z

Altough, usually it's some kind of test for a transform which is not implemented that makes the pipeline fail.

This time I maybe forgot to lint🌚

I'll have to double check whether the pre-commit hooks is active in the Docker container by default.

o-laurent · 2025-03-07T13:53:14Z

Damn, I didn't install the pre-commits on my clone of your fork and now I'm pushing failing code too 🤣

tonyzamyatin · 2025-03-07T13:54:46Z

HAHAHHA🤣

codecov · 2025-03-07T13:55:05Z

Codecov Report

Attention: Patch coverage is 98.36066% with 1 line in your changes missing coverage. Please review.

Project coverage is 99.15%. Comparing base (7d0d345) to head (c737d07).
Report is 62 commits behind head on dev.

✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
...orch_uncertainty/models/wrappers/batch_ensemble.py	97.22%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##              dev     #137      +/-   ##
==========================================
- Coverage   99.15%   99.15%   -0.01%     
==========================================
  Files         143      144       +1     
  Lines        7136     7192      +56     
  Branches      910      923      +13     
==========================================
+ Hits         7076     7131      +55     
- Misses         27       28       +1     
  Partials       33       33

Flag	Coverage Δ
cpu	`99.15% <98.36%> (-0.01%)`	⬇️
pytest	`99.15% <98.36%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

o-laurent

Thank you for your PR! Here are a few quick comments on some details.

torch_uncertainty/routines/classification.py

torch_uncertainty/models/wrappers/batch_ensemble.py

torch_uncertainty/layers/batch_ensemble.py

tests/models/wrappers/test_batch_ensemble.py

experiments/classification/mnist/configs/lenet_ema.yaml

alafage · 2025-03-10T10:10:54Z

Hi @tonyzamyatin,

Nice PR, thank you! I don't know if you saw my answer on Discord, but there are some points that we need to address before merging these changes to dev.

First, we would like to keep the possibility of not repeating the batch during training to fit the original paper better. In this case, we do not need to use format_batch_fn=RepeatTarget(num_repeats=num_estimators) in the ClassificationRoutine().

Secondly, @o-laurent and I agree that the current BatchEnsemble wrapper can be misleading. It does not make a BatchEnsemble but overrides the model's forward to repeat the inputs first. It would be clearer if we had a wrapper taking any nn.Module and dynamically replacing the layers with their BatchEnsemble layer counterparts.

I will make these changes directly on your fork if that's okay.

tonyzamyatin · 2025-03-10T11:33:33Z

Hi @alafage,

That makes total sense!

First, we would like to keep the possibility of not repeating the batch during training to fit the original paper better. In this case, we do not need to use format_batch_fn=RepeatTarget(num_repeats=num_estimators) in the ClassificationRoutine().

For this, I think we could simply pass a boolean flag via the __init__() constructor to indicate whether to replicate the targets for training or not.

Secondly, @o-laurent and I agree that the current BatchEnsemble wrapper can be misleading. It does not make a BatchEnsemble but overrides the model's forward to repeat the inputs first. It would be clearer if we had a wrapper taking any nn.Module and dynamically replacing the layers with their BatchEnsemble layer counterparts.

This also makes a lot of sense, but note that this introduces additional complexity, could lead to silent failures if there is no such implementation, and cause unintended behaviour (e.g. using a pretrained model with the wrapper). Maybe its more explicit to implement BatchEnsemble constructors similar to lenet_batchensemble() in lenet.py?

In any case, I belive it makes sense to add a Note in the docstring that the architecture must be specified to use BatchEnsemble layers.

What do you think?

…ble layers

tonyzamyatin · 2025-03-10T12:06:20Z

All feedback implemented apart from this detail., but I can also make this change if you want :)

- now the input is repeated during training depending on `repeat_training_inputs` argument - with `convert_layers==True` all `nn.Linear` and `nn.Conv2d` layers are replaced by `BatchLinear` and `BatchConv2d`

alafage · 2025-03-10T16:22:26Z

I added a boolean repeat_training_inputs in torch_uncertainty.wrappers.BatchEnsemble.__init__() to control whether to repeat the inputs during training. The inputs are always repeated for evaluation.

Concerning the second point, although it adds more complexity, I have added another boolean parameter to the class: convert_layers. If True, all nn.Linear and nn.Conv2d layers will be replaced by BatchLinear and BatchConv2d layers. To ensure the wrapper is correctly used, we check if there is at least one BatchLinear or BatchConv2d layer in the model. If not, an error will be raised. I took the liberty to update bachensemble_lenet accordingly.

Although I've written tests to validate my modifications, feel free to check that they do not break anything for your experiments.

Cheers!

o-laurent

Seems perfect! Thanks @tonyzamyatin and @alafage!

tonyzamyatin added 4 commits March 7, 2025 12:24

🐛 Add missing learning rate scheduler class paths to lenet configs

6df0914

✨ Add BatchEnsemble wrapper

56ab652

📚 Update documentation regarding (batch) ensemble usage

918de33

✨ Add LeNet BatchEnsemble and Deep Ensemble

c5b62d8

o-laurent changed the base branch from main to dev March 7, 2025 12:40

o-laurent added bug Something isn't working enhancement New feature or request labels Mar 7, 2025

o-laurent self-requested a review March 7, 2025 12:41

o-laurent assigned o-laurent and tonyzamyatin Mar 7, 2025

👕 Lint

a92385e

👕 Also format

ca73b4b

✅ Add test for BatchEnsemble wrapper and LeNet implementation

5976919

tonyzamyatin mentioned this pull request Mar 7, 2025

🐛 Missing ImageMagick Dependency for torch_uncertainty.transforms.corruption #131

Closed

✔️ Add test for BatchEnsemble wrapper and fix bug in batch replication

18dacb3

o-laurent reviewed Mar 10, 2025

View reviewed changes

👌 Comply with PEP 257

79b567d

tonyzamyatin added 2 commits March 10, 2025 11:42

📚 Add note that BatchEnsemble wrapper expects model to use BatchEnsem…

dbf4df9

…ble layers

🔨 Refactor BatchEnsemble test case to use framework's test format

16a235b

alafage added 6 commits March 10, 2025 17:00

🔨 Use einops.repeat instead of torch.repeat in RepeatTarget

6e2596d

🔨 Refine BatchEnsemble layers and add conversion methods

16c7705

🔨 BatchEnsemble wrapper overhaul

9d4b4a4

- now the input is repeated during training depending on `repeat_training_inputs` argument - with `convert_layers==True` all `nn.Linear` and `nn.Conv2d` layers are replaced by `BatchLinear` and `BatchConv2d`

📚 Add BatchEnsemble wrapper utilities in the API Reference

8b29026

📖 BatchEnsemble docstring update

f2ae628

📖 BatchEnsemble docstring update

c737d07

o-laurent approved these changes Mar 10, 2025

View reviewed changes

alafage merged commit 819f7e0 into ENSTA-U2IS-AI:dev Mar 10, 2025
1 check passed

✨ LeNet BatchEnsemble and Deep Ensemble + minor bugfixes #137

✨ LeNet BatchEnsemble and Deep Ensemble + minor bugfixes #137

Uh oh!

Conversation

tonyzamyatin commented Mar 7, 2025

Uh oh!

o-laurent commented Mar 7, 2025

Uh oh!

tonyzamyatin commented Mar 7, 2025

Uh oh!

o-laurent commented Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tonyzamyatin commented Mar 7, 2025

Uh oh!

o-laurent commented Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tonyzamyatin commented Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

o-laurent commented Mar 7, 2025

Uh oh!

tonyzamyatin commented Mar 7, 2025

Uh oh!

tonyzamyatin commented Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

o-laurent commented Mar 7, 2025

Uh oh!

tonyzamyatin commented Mar 7, 2025

Uh oh!

codecov bot commented Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

o-laurent left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alafage commented Mar 10, 2025

Uh oh!

tonyzamyatin commented Mar 10, 2025

Uh oh!

tonyzamyatin commented Mar 10, 2025

Uh oh!

alafage commented Mar 10, 2025

Uh oh!

o-laurent left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

o-laurent commented Mar 7, 2025 •

edited

Loading

o-laurent commented Mar 7, 2025 •

edited

Loading

tonyzamyatin commented Mar 7, 2025 •

edited

Loading

tonyzamyatin commented Mar 7, 2025 •

edited

Loading

codecov bot commented Mar 7, 2025 •

edited

Loading