Skip to content

KwaiVGI/StyleMaster

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

StyleMaster

CVPR'25  project page 

StyleMaster: Stylize Your Video with Artistic Generation and Translation

image

Zixuan Ye1 †, Huijuan Huang2✉, Xintao Wang2, Pengfei Wan2, Di Zhang2, Wenhan Luo1✉

1 Hong Kong University of Science and Technology
2 Kuaishou Technology
† Intern at KwaiVGI, Kuaishou Technology
✉ Corresponding Author

TODO

  • Code and Weight for T2V Implementation on Wan-1.4B based on Diffsynth-Studio are avaiable.
  • Illusion dataset generation

Update

  • [2025.2] StyleMaster has been accepted by CVPR2025!
  • [2024.10] arXiv preprint is available.

Introduction

Welcome to StyleMaster! StyleMaster focuses on style control, i.e., generating or translating a video to match the style of a given reference image. StyleMaster preserves local textures and enhance global style representations. Additionally, a motion adapter and gray tile ControlNet are employed to enhance motion quality and provide precise content guidance.

Features

  • Local Patch Selection: Overcomes content leakage in style transfer by selecting patches with less similarity to text prompts.
  • Global Style Extraction: Uses a projection module after CLIP supervised by illusion datasets.
  • Motion Adapter: Enhances motion quality during inference and helps to enhance the style extent.
  • Gray Tile ControlNet: Provides accessible yet precise content guidance for video style transfer.
  • High-Quality Video Generation: Generates videos with high style similarity to the reference image and achieves ideal translation results.

Illusion Dataset Generation

Please refer to visual_anagrams/readme.md for details.

Style Extraction

Please refer to style_extraction for details.

cd style_extraction
python style_extraction_module.py

Evaluation results

We show the complete results generated by our method and other baselines in Google Drive

Training and Inference on StyleMaster-Wan

please refer to stylemaster-wan/readme.md for details.

Citation

@inproceedings{ye2025stylemaster,
  title={Stylemaster: Stylize your video with artistic generation and translation},
  author={Ye, Zixuan and Huang, Huijuan and Wang, Xintao and Wan, Pengfei and Zhang, Di and Luo, Wenhan},
  booktitle={Proceedings of the Computer Vision and Pattern Recognition Conference},
  pages={2630--2640},
  year={2025}
}

About

[CVPR'25] StyleMaster: Stylize Your Video with Artistic Generation and Translation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published