Skip to content

Release v1.2.2

Compare
Choose a tag to compare
@BeachWang BeachWang released this 14 Mar 09:58
· 39 commits to main since this release
8d09410

Major Updates

  • 🧪 Add document for API service. Add parameter transmission using json.dumps to support API calls for arbitrary registration functions and classes. #613
  • 🚀 Add unit tests for the analysis module and utils module to increase test coverage. #604 #616
  • new A new data synthesis method is proposed, which encourages LLMs to self-generate challenging cognitive questions, achieving superior data efficiency, cross-modality generalization, and SFT effects over SOTA baselines (e.g., 16% gain on MathVision using only 400 samples). See more details in MindGym: Enhancing Vision-Language Models via Synthetic Self-Challenging Questions.

New OPs

  • llm_quality_score_filter: Filter to keep sample with high quality score estimated by LLM, standing for API calling and local VLLM calling. #606 #614 #620
  • llm_difficulty_score_filter: Filter to keep sample with high difficulty score estimated by LLM, standing for API calling and local VLLM calling. #606 #614 #620

Others

  • Fix config in LLaVa pretrain recipe. #610
  • Update news for MindGYM and fix doc. #615
  • Fix decode error through UTF-8 decoding. #618