Skip to content
Change the repository type filter

All

    Repositories list

    • DDT

      Public
      DDT: Decoupled Diffusion Transformer
      Python
      1527120Updated Aug 22, 2025Aug 22, 2025
    • MOTIP

      Public
      [CVPR 2025] Multiple Object Tracking as ID Prediction
      Python
      2133260Updated Aug 20, 2025Aug 20, 2025
    • PixNerd

      Public
      Python
      28620Updated Aug 17, 2025Aug 17, 2025
    • VideoEval

      Public
      VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model
      Python
      01200Updated Jul 31, 2025Jul 31, 2025
    • Video-DC

      Public
      Python
      11110Updated Jul 30, 2025Jul 30, 2025
    • CaReBench

      Public
      A Fine-grained Benchmark for Video Captioning and Retrieval
      Python
      12020Updated Jul 16, 2025Jul 16, 2025
    • [CVPR 2025] Online Video Understanding: OVBench and VideoChat-Online
      Python
      35970Updated Jul 9, 2025Jul 9, 2025
    • [ICML 2025] Differentiable Solver Search for Fast Diffusion Sampling
      Python
      02200Updated Jul 7, 2025Jul 7, 2025
    • p-MoD

      Public
      [ICCV 2025] p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
      Python
      24210Updated Jun 26, 2025Jun 26, 2025
    • DEQDet

      Public
      [ICCV 2023] Deep Equilibrium Object Detection
      Jupyter Notebook
      12510Updated Jun 18, 2025Jun 18, 2025
    • SORCE

      Public
      Small Object Retrieval in Complex Environments (SORCE)
      Python
      1400Updated Jun 2, 2025Jun 2, 2025
    • DMM

      Public
      DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging
      Python
      44520Updated Apr 27, 2025Apr 27, 2025
    • [TPAMI] JointFormer: A Unified Framework with Joint Modeling for Video Object Segmentation
      Python
      0900Updated Apr 16, 2025Apr 16, 2025
    • Tra-MoE

      Public
      [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
      Python
      24300Updated Apr 1, 2025Apr 1, 2025
    • TPM

      Public
      [WACV 2025 Oral] Transferring Foundation Models for Generalizable Robotic Manipulation
      Python
      02200Updated Mar 28, 2025Mar 28, 2025
    • MoG_Web

      Public
      JavaScript
      0000Updated Mar 11, 2025Mar 11, 2025
    • MoG-VFI

      Public
      Motion-Aware Generative Frame Interpolation
      Python
      33430Updated Mar 11, 2025Mar 11, 2025
    • HTML
      0100Updated Jan 13, 2025Jan 13, 2025
    • FlowDCN

      Public
      [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution
      Python
      13400Updated Dec 23, 2024Dec 23, 2024
    • SPLAM

      Public
      [ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model
      Python
      12110Updated Nov 1, 2024Nov 1, 2024
    • AWT

      Public
      [NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
      Python
      610610Updated Oct 5, 2024Oct 5, 2024
    • VFIMamba

      Public
      [NeurIPS 2024] VFIMamba: Video Frame Interpolation with State Space Models
      Python
      911670Updated Sep 26, 2024Sep 26, 2024
    • [TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
      Python
      02700Updated Sep 11, 2024Sep 11, 2024
    • PRVG

      Public
      [CVIU 2024] End-to-end dense video grounding via parallel regression
      Python
      0600Updated Sep 11, 2024Sep 11, 2024
    • BIVDiff

      Public
      [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models
      Python
      27500Updated Sep 11, 2024Sep 11, 2024
    • CoMAE

      Public
      [AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets
      Python
      43731Updated Aug 20, 2024Aug 20, 2024
    • SparseOcc

      Public
      [ECCV 2024] Fully Sparse 3D Occupancy Prediction & RayIoU Evaluation Metric
      Python
      28354221Updated Aug 15, 2024Aug 15, 2024
    • ProVP

      Public
      [IJCV] Progressive Visual Prompt Learning with Contrastive Feature Re-formation
      Python
      01400Updated Aug 10, 2024Aug 10, 2024
    • CamLiFlow

      Public
      [CVPR 2022 Oral & TPAMI 2023] Learning Optical Flow and Scene Flow with Bidirectional Camera-LiDAR Fusion
      Python
      2223910Updated Jul 29, 2024Jul 29, 2024
    • ZeroI2V

      Public
      [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
      Python
      12200Updated Jul 29, 2024Jul 29, 2024