Kuaishou Visual Generation and Interaction Center

All

21 repositories

VideoAlign
Public
Improving Video Generation with Human Feedback
Python
•3•272•7•1•Updated Aug 14, 2025Aug 14, 2025
ReCamMaster
Public
[ICCV'25 Oral] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
computer-vision camera-control 4d video-generation aigc
Python
•
MIT License
•64•1.4k•49•1•Updated Jul 24, 2025Jul 24, 2025
StyleMaster
Public
[CVPR'25] StyleMaster: Stylize Your Video with Artistic Generation and Translation
Jupyter Notebook
•4•132•7•0•Updated Jul 17, 2025Jul 17, 2025
MODA
Public
[ICML 2025 Spotlight] MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding
Python
•
Apache License 2.0
•1•53•2•0•Updated Jul 10, 2025Jul 10, 2025
VIVID
Public
HTML
•0•1•0•0•Updated Jul 10, 2025Jul 10, 2025
RoboMaster
Public
[ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control
Python
•0•76•3•0•Updated Jul 4, 2025Jul 4, 2025
3DTrajMaster
Public
[ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation
Jupyter Notebook
•16•354•0•0•Updated Jul 4, 2025Jul 4, 2025
VMoBA
Public
Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"
Python
•3•39•1•0•Updated Jul 1, 2025Jul 1, 2025
SPF-Portrait
Public
Official implementation of "SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning"
1•65•1•0•Updated Jun 23, 2025Jun 23, 2025
LivePortrait
Public
Bring portraits to life!
video-editing image-animation video-generation face-animation
Python
•
Other
•1.7k•17k•267•10•Updated Jun 14, 2025Jun 14, 2025
SynCamMaster
Public
[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
video-generation
Python
•17•614•15•0•Updated May 23, 2025May 23, 2025
SocioEmoDialog
Public
Scripts for processing and evaluating SocioEmoDialog datasets. It includes the core processing scripts, evaluation metrics, and additional documentation.
Python
•0•3•0•0•Updated May 16, 2025May 16, 2025
ComfyUI-KLingAI-API
Public
Python
•10•139•11•3•Updated May 6, 2025May 6, 2025
DiffMoE
Public
PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT
Python
•
Apache License 2.0
•4•122•1•0•Updated Apr 19, 2025Apr 19, 2025
HumanAesExpert
Public
Official implementation of "HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment"
Python
•
MIT License
•2•63•2•0•Updated Apr 15, 2025Apr 15, 2025
GameFactory
Public
[ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos
Python
•12•389•5•0•Updated Mar 22, 2025Mar 22, 2025
Koala-36M
Public
Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".
video-generation video-datasets
Python
•5•197•7•0•Updated Mar 19, 2025Mar 19, 2025
Uniaa
Public
Unified Multi-modal IAA Baseline and Benchmark
benchmark dataset image-aesthetic-assessment mllm llava
Python
•6•84•4•0•Updated Sep 27, 2024Sep 27, 2024
I2V-Adapter
Public
I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models
Python
•12•223•6•0•Updated Jun 18, 2024Jun 18, 2024
DVIS_Plus
Public
Decoupled Video Instance Segmentation Framework, improved version of dvis
Python
•
MIT License
•2•9•0•0•Updated May 22, 2024May 22, 2024
DVIS
Public
Decoupled Video Instance Segmentation Framework
Python
•
MIT License
•1•6•0•0•Updated May 22, 2024May 22, 2024