Skip to content
Change the repository type filter

All

    Repositories list

    • Structured Video Comprehension of Real-World Shorts
      Python
      7197130Updated Sep 20, 2025Sep 20, 2025
    • IC-Custom

      Public
      [Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learning
      Python
      313530Updated Sep 15, 2025Sep 15, 2025
    • vllm

      Public
      vllm for ARC-Hunyuan-Video-7B
      Python
      0005Updated Sep 8, 2025Sep 8, 2025
    • Official implementation of the paper "GenCompositor: Generative Video Compositing with Diffusion Transformer"
      Python
      211420Updated Sep 4, 2025Sep 4, 2025
    • BrushEdit

      Public
      [under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
      Python
      28581110Updated Sep 3, 2025Sep 3, 2025
    • AudioStory: Generating Long-Form Narrative Audio with Large Language Models
      Jupyter Notebook
      1627231Updated Sep 2, 2025Sep 2, 2025
    • Streamlining Cartoon Production with Generative Post-Keyframing
      Python
      3242870Updated Aug 20, 2025Aug 20, 2025
    • TokLIP

      Public
      TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation
      Python
      521670Updated Aug 18, 2025Aug 18, 2025
    • [ICCV 2025] FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction
      JavaScript
      1419992Updated Aug 4, 2025Aug 4, 2025
    • HTML
      0000Updated Aug 1, 2025Aug 1, 2025
    • [ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
      Python
      1538730Updated Jul 30, 2025Jul 30, 2025
    • Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?
      Python
      07410Updated Jul 13, 2025Jul 13, 2025
    • SEED-Voken: A Series of Powerful Visual Tokenizers
      Python
      3694021Updated Jun 27, 2025Jun 27, 2025
    • Python
      28810Updated Jun 23, 2025Jun 23, 2025
    • GRPO-CARE

      Public
      Python
      17540Updated Jun 23, 2025Jun 23, 2025
    • MindOmni

      Public
      Python
      012710Updated Jun 18, 2025Jun 18, 2025
    • Moto

      Public
      [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
      Python
      313340Updated May 11, 2025May 11, 2025
    • ColorFlow

      Public
      The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization". ColorFlow:基于检索增强的图像序列上色
      Python
      36432130Updated Apr 16, 2025Apr 16, 2025
    • [ICCV 2025] AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction
      Python
      2833151Updated Apr 9, 2025Apr 9, 2025
    • [SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"
      Python
      2848590Updated Apr 8, 2025Apr 8, 2025
    • DiTCtrl

      Public
      [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation"
      Python
      730070Updated Mar 30, 2025Mar 30, 2025
    • DI-PCG

      Public
      Code release of our paper "DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation".
      Python
      312430Updated Mar 23, 2025Mar 23, 2025
    • BlobCtrl

      Public
      [SIGGRAPH ASIA'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
      Python
      11810Updated Mar 20, 2025Mar 20, 2025
    • Divot

      Public
      Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)
      Python
      28230Updated Feb 27, 2025Feb 27, 2025
    • Official Code for MotionCtrl [SIGGRAPH 2024]
      Python
      761.5k280Updated Feb 19, 2025Feb 19, 2025
    • ViT-Lens

      Public
      [CVPR 2024] ViT-Lens: Towards Omni-modal Representations
      Python
      1218140Updated Feb 3, 2025Feb 3, 2025
    • A framework to convert any 2D videos to immersive stereoscopic 3D
      Python
      34383241Updated Jan 7, 2025Jan 7, 2025
    • InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
      Python
      4514k1134Updated Jan 3, 2025Jan 3, 2025
    • BrushNet

      Public
      [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
      Python
      1371.7k540Updated Dec 17, 2024Dec 17, 2024
    • [CVPR 2025] Boosting Generative Novel View Synthesis with Sparse and Unposed Images
      Python
      612130Updated Dec 9, 2024Dec 9, 2024