Skip to content
Change the repository type filter

All

    Repositories list

    • EgoMask

      Public
      [ICCV 2025] "Fine-grained Spatiotemporal Grounding on Egocentric Videos"
      Python
      0700Updated Aug 4, 2025Aug 4, 2025
    • VG-LLM

      Public
      The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'
      Jupyter Notebook
      210260Updated Aug 3, 2025Aug 3, 2025
    • AIM

      Public
      [ICCV 2025] Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"
      Python
      23710Updated Jun 26, 2025Jun 26, 2025
    • [CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.
      Python
      1014350Updated Jun 4, 2025Jun 4, 2025
    • JavaScript
      0000Updated Jun 2, 2025Jun 2, 2025
    • C2LEVA

      Public
      [Findings of ACL 2025] "C2LEVA: Toward Comprehensive and Contamination-Free Language Model Evaluation"
      0200Updated May 27, 2025May 27, 2025
    • CLEVA

      Public
      [EMNLP 2023 Demo] "CLEVA: Chinese Language Models EVAluation Platform"
      Shell
      36310Updated May 16, 2025May 16, 2025
    • FTTT

      Public
      [ACL 2025] Official code for ''Learning to Reason from Feedback at Test-Time''.
      Python
      01100Updated May 16, 2025May 16, 2025
    • [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"
      Python
      12000Updated Oct 17, 2024Oct 17, 2024
    • TG-Vid

      Public
      [EMNLP 2024] Official code for "Enhancing Temporal Modeling of Video LLMs via Time Gating"
      Python
      0600Updated Oct 10, 2024Oct 10, 2024
    • [ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasoners
      Python
      01600Updated May 28, 2024May 28, 2024
    • CSS
      0000Updated Apr 11, 2024Apr 11, 2024
    • NaviLLM

      Public
      [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'
      Python
      144600Updated Apr 11, 2024Apr 11, 2024
    • MVT-3DVG

      Public
      [CVPR 2022] Multi-View Transformer for 3D Visual Grounding
      C++
      5100Updated Nov 9, 2022Nov 9, 2022