Skip to content

v0.3.0

Choose a tag to compare

@pan-x-c pan-x-c released this 09 Sep 10:14
· 42 commits to main since this release
7d2323f

Overview

Framework Development

Buffer Module

  1. Use Operator Interface to replace the original AddStrategy. Operator can perform various transformations on experience data in a pipeline manner. [Breaking Change]
  2. Add TaskPipeline and ExperiencePipeline for task and experience data preprocessing.
  3. Support calling Data-Juicer services in both TaskPipeline and ExperiencePipeline, and resolve some dependency conflicts.
  4. Refactor SQL/FILE storage. SQL can store SFT/DPO/Rollout/Experience data. SQL and FILE support parsing multi-turn SFT data with tools. [Breaking Change]

Trainer Module

  1. Support FSDP2 backend
  2. Support Megatron backend
  3. Support Qwen2.5 VL muti-modal models [Experimental]

Explorer Module

  1. Support Qwen2.5 VL multi-modal models [Experimental]
  2. Workflow supports running in async mode.
  3. ModelWrapper provides openai.AsyncOpenAI interface.

Utils Module

  1. Enhance logger and support printing logs of different actors to different files under the checkpoint dir
  2. Enhance wandb and mlflow monitor

New Algorithms

  1. AsymRE
  2. sPPO
  3. RULER
  4. TOPR and CISPO

New Workflows

  1. General Multi-turn Email Search

Others

  1. Support uv
  2. Refactor README and documents
  3. Fix many bugs

What's Changed

Full Changelog: v0.2.1...v0.3.0