Skip to content

Popular repositories Loading

  1. Cherry_LLM Cherry_LLM Public

    [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models

    Python 398 26

  2. Reflection_Tuning Reflection_Tuning Public

    [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

    Python 363 30

  3. HallusionBench HallusionBench Public

    [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

    Python 305 8

  4. Superfiltering Superfiltering Public

    [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

    Python 179 14

  5. MoE-Embedding MoE-Embedding Public

    [ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"

    Python 82 11

  6. MiP-Overthinking MiP-Overthinking Public

    [COLM'25] Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?

    Python 35 1

Repositories

Showing 10 of 18 repositories
  • HallusionBench Public

    [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

    tianyi-lab/HallusionBench’s past year of commit activity
    Python 305 BSD-3-Clause 8 0 0 Updated Oct 14, 2025
  • RuleR Public

    [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling

    tianyi-lab/RuleR’s past year of commit activity
    Python 14 1 1 0 Updated Sep 27, 2025
  • Mosaic-IT Public

    [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning

    tianyi-lab/Mosaic-IT’s past year of commit activity
    Python 20 3 0 0 Updated Sep 27, 2025
  • ColorBench Public

    [NeurIPS'25] ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

    tianyi-lab/ColorBench’s past year of commit activity
    Python 27 Apache-2.0 0 0 0 Updated Sep 27, 2025
  • DisCL Public

    [ICCV 2025] Diffusion Curriculum (DisCL)

    tianyi-lab/DisCL’s past year of commit activity
    Jupyter Notebook 13 0 2 0 Updated Sep 26, 2025
  • FaSTAR Public

    Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

    tianyi-lab/FaSTAR’s past year of commit activity
    Jupyter Notebook 28 BSD-3-Clause 2 0 0 Updated Jun 27, 2025
  • Superfiltering Public

    [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

    tianyi-lab/Superfiltering’s past year of commit activity
    Python 179 14 0 0 Updated Jun 25, 2025
  • Cherry_LLM Public

    [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models

    tianyi-lab/Cherry_LLM’s past year of commit activity
    Python 398 26 1 0 Updated Jun 25, 2025
  • MiP-Overthinking Public

    [COLM'25] Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?

    tianyi-lab/MiP-Overthinking’s past year of commit activity
    Python 35 MIT 1 1 0 Updated Jun 5, 2025
  • C3PO Public

    [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"

    tianyi-lab/C3PO’s past year of commit activity
    Jupyter Notebook 18 Apache-2.0 1 1 0 Updated Apr 9, 2025

Most used topics

Loading…