Skip to content
Change the repository type filter

All

    Repositories list

    • CheemsRM

      Public
      ACL'25: Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
      Python
      01000Updated Jun 10, 2025Jun 10, 2025
    • Repo of the ACL'25 Findings paper "Critic-CoT: Boosting the Reasoning Abilities of Large Language Model via Chain-of-Thought Critic"
      Python
      0500Updated May 27, 2025May 27, 2025
    • RLFH

      Public
      ACL'25 Findings: On-Policy Fine-grained Knowledge Feedback for Hallucination Mitigation
      Python
      0800Updated May 26, 2025May 26, 2025
    • ICML'25: The Devil Is in the Details: Tackling Unimodal Spurious Correlations for Generalizable Multimodal Reward Models
      Python
      01010Updated May 19, 2025May 19, 2025