Skip to content

0.0.5

Latest
Compare
Choose a tag to compare
@github-actions github-actions released this 19 Jul 02:05
  • Add filter interface
  • Add Formatron support (JSON/KBNF grammar etc.)
  • Support Ernie 4.5 architecture
  • Support SmolLM3 architecture
  • Support Exaone4 architecture
  • Improve Cohere2 support (fixes support for Command-A)
  • Fix compatibility with certain Pixtral preprocessors
  • Fix excessive virtual memory usage with large generator queues on Windows
  • Add facilities for manually mixing/optimizing quants
  • Add MMLU eval script
  • Various other fixes, improvements and optimizations

Full Changelog: v0.0.4...v0.0.5