An Enhancement Proposal Workflow #1674

janfb · 2025-10-13T18:56:56Z

I suggest we adopt an Enhancement Proposal (EP) workflow to better organize and discuss larger refactorings and API changes. EPs should live on the docs website and have a correspondng GH discussions entry for public discussions.
This will help us plan and reach consensus on major changes (e.g., roadmap toward sbi 1.0, deprecations, architectural refactors), while keeping the process transparent and easy to follow.

I created a corresponding entry on the website, an explanation of the workflow and an example EP-01 for the planned training infrastructure refactoring (numfocus/small-development-grant-proposals#60).

In general, I see this as a first step towards a more transparent and professional governance model.

All open for discussions, let's discuss in next week's org meeting.

codecov · 2025-10-13T19:03:36Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 84.59%. Comparing base (4d1dc5f) to head (634a980).
⚠️ Report is 2 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1674      +/-   ##
==========================================
- Coverage   87.77%   84.59%   -3.19%     
==========================================
  Files         134      137       +3     
  Lines       11126    11417     +291     
==========================================
- Hits         9766     9658     -108     
- Misses       1360     1759     +399

Flag	Coverage Δ
unittests	`84.59% <ø> (-3.19%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.
see 43 files with indirect coverage changes

michaeldeistler

Thanks!

The general idea of EPs is great, I really like it! We should also advertise this on the contributing.md (or even require it for "external" PRs).

I do have doubts about the new abstractions though. For example, we currently have EarlyStopping. In the future, we would probably also have LRScheduler, ClippingMethod, and possibly more. Should we really aim to build the ultimate training loop? I think this has a few downsides:

all of these features have to be well-documented (otherwise will forever be unused)
they create more abstractions, which will make it harder for new maintainers
more testing...

The simple alternative would be to simply point people to the flexible training loop. Everybody can use an LLM to get, for example, a learning rate scheduler. Of course, this requires a bit more work for users, but getting to know yet another sbi abstraction (which includes reading and understanding documentation, testing the feature, making sure it works,...) is also non-zero work for users.

michaeldeistler · 2025-10-14T07:30:20Z