v0.2.0

Latest

Latest

garrett4wade released this 31 Mar 00:50

· 26 commits to main since this release

de3f66a

Our milestone release, AReaL-boba 🎉

Features

Quickstart by default yaml config and commandline overrides. Check our updated tutorial!
Full SGLang support and other system optimizations for 1.5x faster RL training.
SOTA 7B math reasoning: 61.9 AIME24 & 48.3 AIME25
200-sample 32B tuning match QwQ on AIME24

We fully open-source all code, model, and data. Check our technical blog for more details!

Assets 2