Skip to content

v2.4.0

Latest

Choose a tag to compare

@shibing624 shibing624 released this 17 Feb 03:18
· 71 commits to main since this release

v2.4.0

  1. 新增GRPO训练方法,GRPO通过纯RL方法可以体验aha momenthttps://github.com/shibing624/MedicalGPT/blob/main/run_grpo.sh
  2. 支持了 DeepSeek-V3, DeepSeek-R1 模型, template_name=deepseek3