Our milestone release, AReaL-boba ๐
Features
- Quickstart by default yaml config and commandline overrides. Check our updated tutorial!
- Full SGLang support and other system optimizations for 1.5x faster RL training.
- SOTA 7B math reasoning: 61.9 AIME24 & 48.3 AIME25
- 200-sample 32B tuning match QwQ on AIME24
We fully open-source all code, model, and data. Check our technical blog for more details!