ARC Lab, Tencent PCG

All

74 repositories

ARC-Hunyuan-Video-7B
Public
Structured Video Comprehension of Real-World Shorts
Python
•
Other
•7•197•13•0•Updated Sep 20, 2025Sep 20, 2025
IC-Custom
Public
[Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learning
flux application image image-editing image-inpainting image-customization aigc
Python
•
Other
•3•135•3•0•Updated Sep 15, 2025Sep 15, 2025
vllm
Public
vllm for ARC-Hunyuan-Video-7B
Python
•
Apache License 2.0
•0•0•0•5•Updated Sep 8, 2025Sep 8, 2025
GenCompositor
Public
Official implementation of the paper "GenCompositor: Generative Video Compositing with Diffusion Transformer"
video-editing diffusion-models diffusion-transformer
Python
•
Other
•2•114•2•0•Updated Sep 4, 2025Sep 4, 2025
BrushEdit
Public
[under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
image-editing image-inpainting diffusion-models
Python
•
Other
•28•581•11•0•Updated Sep 3, 2025Sep 3, 2025
AudioStory
Public
AudioStory: Generating Long-Form Narrative Audio with Large Language Models
video-to-audio diffusion-models text-to-audio audio-generation multimodal-large-language-models video-dubbing
Jupyter Notebook
•16•272•3•1•Updated Sep 2, 2025Sep 2, 2025
ToonComposer
Public
Streamlining Cartoon Production with Generative Post-Keyframing
Python
•
Other
•32•428•7•0•Updated Aug 20, 2025Aug 20, 2025
TokLIP
Public
TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation
Python
•
Other
•5•216•7•0•Updated Aug 18, 2025Aug 18, 2025
FreeSplatter
Public
[ICCV 2025] FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction
JavaScript
•
Other
•14•199•9•2•Updated Aug 4, 2025Aug 4, 2025
TencentARC.github.io
Public
HTML
•0•0•0•0•Updated Aug 1, 2025Aug 1, 2025
GeometryCrafter
Public
[ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
depth-estimation video-to-4d iccv2025
Python
•
Other
•15•387•3•0•Updated Jul 30, 2025Jul 30, 2025
Video-Holmes
Public
Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?
Python
•
Apache License 2.0
•0•74•1•0•Updated Jul 13, 2025Jul 13, 2025
SEED-Voken
Public
SEED-Voken: A Series of Powerful Visual Tokenizers
Python
•
Apache License 2.0
•36•940•2•1•Updated Jun 27, 2025Jun 27, 2025
SEED-Bench-R1
Public
Python
•
Apache License 2.0
•2•88•1•0•Updated Jun 23, 2025Jun 23, 2025
GRPO-CARE
Public
Python
•
Apache License 2.0
•1•75•4•0•Updated Jun 23, 2025Jun 23, 2025
MindOmni
Public
Python
•
Other
•0•127•1•0•Updated Jun 18, 2025Jun 18, 2025
Moto
Public
[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
Python
•
Other
•3•133•4•0•Updated May 11, 2025May 11, 2025
ColorFlow
Public
The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization". ColorFlow：基于检索增强的图像序列上色
computer-vision image-colorization colorization automatic-colorization
Python
•
Other
•36•432•13•0•Updated Apr 16, 2025Apr 16, 2025
AnimeGamer
Public
[ICCV 2025] AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction
Python
•
Other
•28•331•5•1•Updated Apr 9, 2025Apr 9, 2025
VideoPainter
Public
[SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"
video video-editing video-inpainting video-dataset
Python
•
Other
•28•485•9•0•Updated Apr 8, 2025Apr 8, 2025
DiTCtrl
Public
[CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation"
Python
•
Other
•7•300•7•0•Updated Mar 30, 2025Mar 30, 2025
DI-PCG
Public
Code release of our paper "DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation".
Python
•
Other
•3•124•3•0•Updated Mar 23, 2025Mar 23, 2025
BlobCtrl
Public
[SIGGRAPH ASIA'25] BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing
image-editing aigc
Python
•
Other
•1•18•1•0•Updated Mar 20, 2025Mar 20, 2025
Divot
Public
Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)
Python
•
Other
•2•82•3•0•Updated Feb 27, 2025Feb 27, 2025
MotionCtrl
Public
Official Code for MotionCtrl [SIGGRAPH 2024]
Python
•
Apache License 2.0
•76•1.5k•28•0•Updated Feb 19, 2025Feb 19, 2025
ViT-Lens
Public
[CVPR 2024] ViT-Lens: Towards Omni-modal Representations
multimodal-learning
Python
•
Other
•12•181•4•0•Updated Feb 3, 2025Feb 3, 2025
StereoCrafter
Public
A framework to convert any 2D videos to immersive stereoscopic 3D
Python
•
Other
•34•383•24•1•Updated Jan 7, 2025Jan 7, 2025
InstantMesh
Public
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
Python
•
Apache License 2.0
•451•4k•113•4•Updated Jan 3, 2025Jan 3, 2025
BrushNet
Public
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
text-to-image image-inpainting diffusion eccv diffusion-models eccv2024
Python
•
Other
•137•1.7k•54•0•Updated Dec 17, 2024Dec 17, 2024
NVComposer
Public
[CVPR 2025] Boosting Generative Novel View Synthesis with Sparse and Unposed Images
Python
•
Other
•6•121•3•0•Updated Dec 9, 2024Dec 9, 2024