Skip to content
Change the repository type filter

All

    Repositories list

    • VerIF

      Public
      [EMNLP 2025] Verification Engineering for RL in Instruction Following
      Python
      03024Updated Aug 4, 2025Aug 4, 2025
    • 0110Updated Jul 23, 2025Jul 23, 2025
    • RM-Bench

      Public
      [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style
      Python
      25930Updated Jul 18, 2025Jul 18, 2025
    • OpenSAE

      Public
      Python
      12800Updated Jul 17, 2025Jul 17, 2025
    • Python
      21110Updated Jun 25, 2025Jun 25, 2025
    • Python
      21200Updated Jun 18, 2025Jun 18, 2025
    • LLMAEL

      Public
      [ACL Workshop 2025] LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking
      Python
      11210Updated Jun 16, 2025Jun 16, 2025
    • [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
      Python
      510200Updated Jun 11, 2025Jun 11, 2025
    • Python
      1214620Updated May 28, 2025May 28, 2025
    • AtomR

      Public
      [KDD 2025] AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning
      Jupyter Notebook
      21200Updated May 27, 2025May 27, 2025
    • MMGeoLM

      Public
      Python
      0610Updated May 27, 2025May 27, 2025
    • Crab

      Public
      Constraint Back-translation Improves Complex Instruction Following of Large Language Models
      Python
      01400Updated May 23, 2025May 23, 2025
    • AgentIF

      Public
      AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios
      Python
      01711Updated May 23, 2025May 23, 2025
    • ReaRAG

      Public
      ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation
      Python
      21800Updated May 8, 2025May 8, 2025
    • Python
      01100Updated Apr 14, 2025Apr 14, 2025
    • [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models
      Python
      01900Updated Mar 29, 2025Mar 29, 2025
    • Python
      1100Updated Mar 20, 2025Mar 20, 2025
    • MRCEval

      Public
      MRCEval: A Comprehensive, Challenging and Accessible Machine Reading Comprehension Benchmark
      Python
      0400Updated Mar 12, 2025Mar 12, 2025
    • OmniEvent

      Public
      A comprehensive, unified and modular event extraction toolkit.
      Python
      3738884Updated Dec 18, 2024Dec 18, 2024
    • ADELIE

      Public
      [EMNLP2024] Aligning Large Language Models on Information Extraction
      Python
      25310Updated Nov 4, 2024Nov 4, 2024
    • KB-Plugin

      Public
      [EMNLP2024] KB-Plugin: A Plug-and-play Framework for Large Language Models to Induce Programs over Low-resourced Knowledge Bases
      Python
      1900Updated Oct 16, 2024Oct 16, 2024
    • The data and source code for the paper "MoocRadar: A Fine-grained and Multi-aspect Knowledge Repository for Improving Cognitive Student Modeling in MOOCs"
      Python
      24660Updated Oct 7, 2024Oct 7, 2024
    • DICE

      Public
      DICE: Detecting In-distribution Data Contamination with LLM's Internal State
      Python
      0900Updated Sep 21, 2024Sep 21, 2024
    • Data and code for the paper: Finding Safety Neurons in Large Language Models
      Jupyter Notebook
      0720Updated Sep 21, 2024Sep 21, 2024
    • Papers on LLM Reasoning and Retrieval-Augmented LLM Reasoning
      0700Updated Aug 27, 2024Aug 27, 2024
    • DiaKoP

      Public
      DiaKoP (CIKM Demo 2024)
      JavaScript
      0400Updated Aug 7, 2024Aug 7, 2024
    • Python
      0500Updated Jul 22, 2024Jul 22, 2024
    • Data and Code for the paper, Knowledge-to-Jailbreak: One Knowledge Point Worth One Attack.
      Python
      3800Updated Jun 28, 2024Jun 28, 2024
    • SeaKR

      Public
      Python
      63040Updated Jun 26, 2024Jun 26, 2024
    • ARTE

      Public
      0400Updated Jun 24, 2024Jun 24, 2024