- Add filter interface
- Add Formatron support (JSON/KBNF grammar etc.)
- Support Ernie 4.5 architecture
- Support SmolLM3 architecture
- Support Exaone4 architecture
- Improve Cohere2 support (fixes support for Command-A)
- Fix compatibility with certain Pixtral preprocessors
- Fix excessive virtual memory usage with large generator queues on Windows
- Add facilities for manually mixing/optimizing quants
- Add MMLU eval script
- Various other fixes, improvements and optimizations
Full Changelog: v0.0.4...v0.0.5