StyleMaster

StyleMaster: Stylize Your Video with Artistic Generation and Translation

Zixuan Ye^{1 †}, Huijuan Huang^2✉, Xintao Wang², Pengfei Wan², Di Zhang², Wenhan Luo^1✉

1 Hong Kong University of Science and Technology
2 Kuaishou Technology
† Intern at KwaiVGI, Kuaishou Technology
✉ Corresponding Author

TODO

Code and Weight for T2V Implementation on Wan-1.4B based on Diffsynth-Studio are avaiable.
Illusion dataset generation

Update

[2025.2] StyleMaster has been accepted by CVPR2025!
[2024.10] arXiv preprint is available.

Introduction

Welcome to StyleMaster! StyleMaster focuses on style control, i.e., generating or translating a video to match the style of a given reference image. StyleMaster preserves local textures and enhance global style representations. Additionally, a motion adapter and gray tile ControlNet are employed to enhance motion quality and provide precise content guidance.

Features

Local Patch Selection: Overcomes content leakage in style transfer by selecting patches with less similarity to text prompts.
Global Style Extraction: Uses a projection module after CLIP supervised by illusion datasets.
Motion Adapter: Enhances motion quality during inference and helps to enhance the style extent.
Gray Tile ControlNet: Provides accessible yet precise content guidance for video style transfer.
High-Quality Video Generation: Generates videos with high style similarity to the reference image and achieves ideal translation results.

Illusion Dataset Generation

Please refer to visual_anagrams/readme.md for details.

Style Extraction

Please refer to style_extraction for details.

cd style_extraction
python style_extraction_module.py

Evaluation results

We show the complete results generated by our method and other baselines in Google Drive

Training and Inference on StyleMaster-Wan

please refer to stylemaster-wan/readme.md for details.

Citation

@inproceedings{ye2025stylemaster,
  title={Stylemaster: Stylize your video with artistic generation and translation},
  author={Ye, Zixuan and Huang, Huijuan and Wang, Xintao and Wan, Pengfei and Zhang, Di and Luo, Wenhan},
  booktitle={Proceedings of the Computer Vision and Pattern Recognition Conference},
  pages={2630--2640},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
assets		assets
evaluation		evaluation
style_extraction		style_extraction
stylemaster-wan		stylemaster-wan
visual_anagrams		visual_anagrams
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

StyleMaster

TODO

Update

Introduction

Features

Illusion Dataset Generation

Style Extraction

Evaluation results

Training and Inference on StyleMaster-Wan

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Languages

KwaiVGI/StyleMaster

Folders and files

Latest commit

History

Repository files navigation

StyleMaster

TODO

Update

Introduction

Features

Illusion Dataset Generation

Style Extraction

Evaluation results

Training and Inference on StyleMaster-Wan

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Languages

Packages