Skip to content
Change the repository type filter

All

    Repositories list

    • Improving Video Generation with Human Feedback
      Python
      327271Updated Aug 14, 2025Aug 14, 2025
    • [ICCV'25 Oral] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
      Python
      641.4k491Updated Jul 24, 2025Jul 24, 2025
    • [CVPR'25] StyleMaster: Stylize Your Video with Artistic Generation and Translation
      Jupyter Notebook
      413270Updated Jul 17, 2025Jul 17, 2025
    • MODA

      Public
      [ICML 2025 Spotlight] MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding
      Python
      15320Updated Jul 10, 2025Jul 10, 2025
    • VIVID

      Public
      HTML
      0100Updated Jul 10, 2025Jul 10, 2025
    • [ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control
      Python
      07630Updated Jul 4, 2025Jul 4, 2025
    • [ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation
      Jupyter Notebook
      1635400Updated Jul 4, 2025Jul 4, 2025
    • VMoBA

      Public
      Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"
      Python
      33910Updated Jul 1, 2025Jul 1, 2025
    • Official implementation of "SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning"
      16510Updated Jun 23, 2025Jun 23, 2025
    • Bring portraits to life!
      Python
      1.7k17k26710Updated Jun 14, 2025Jun 14, 2025
    • [ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
      Python
      17614150Updated May 23, 2025May 23, 2025
    • Scripts for processing and evaluating SocioEmoDialog datasets. It includes the core processing scripts, evaluation metrics, and additional documentation.
      Python
      0300Updated May 16, 2025May 16, 2025
    • Python
      10139113Updated May 6, 2025May 6, 2025
    • DiffMoE

      Public
      PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT
      Python
      412210Updated Apr 19, 2025Apr 19, 2025
    • Official implementation of "HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment"
      Python
      26320Updated Apr 15, 2025Apr 15, 2025
    • [ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos
      Python
      1238950Updated Mar 22, 2025Mar 22, 2025
    • Koala-36M

      Public
      Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".
      Python
      519770Updated Mar 19, 2025Mar 19, 2025
    • Uniaa

      Public
      Unified Multi-modal IAA Baseline and Benchmark
      Python
      68440Updated Sep 27, 2024Sep 27, 2024
    • I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models
      Python
      1222360Updated Jun 18, 2024Jun 18, 2024
    • DVIS_Plus

      Public
      Decoupled Video Instance Segmentation Framework, improved version of dvis
      Python
      2900Updated May 22, 2024May 22, 2024
    • DVIS

      Public
      Decoupled Video Instance Segmentation Framework
      Python
      1600Updated May 22, 2024May 22, 2024