English | δΈζ
ππ¬ Flexible, efficient, and scalable toolbox for editing and dubbing, unleashing creative potential
π§ Dubbing | βοΈ Editing |
---|---|
![]() |
![]() |
dahuaxiyou_zh.mp4 |
dahuaxiyou_en.mp4 |
qiuzhimianshi_en.mp4 |
qiuzhimianshi_zh.mp4 |
wukong_zh.mp4 |
wukong_en.mp4 |
shangpin.jieshao_en.mp4 |
shangpin.jieshao_zh.mp4 |
π Refer to the usage guide: Local Installation | Remote Deployment
-
π₯ Content Creators: Optimize video dubbing, translation, and editing workflows to enhance efficiency and unleash creative potential
-
π Multilingual Translation/Dubbing: Create localized content for overseas audiences and publish across languages
-
βοΈ Independent Deployment: Deploy locally with flexible configurations to ensure privacy
-
π€ Subtitle Recognition
Accurately transcribe voices from videos and audio, with flexible configuration to adapt to different devices and scenarios, ensuring high-quality text generation
-
π Language Translation
Translate between multiple languages, switch translation providers, and adjust advanced parameters to optimize translation results and overcome language barriers
-
π§ Speech Synthesis
A rich library of voices and customization options for personalized dubbing experiences, meeting creative needs with real-time previews for precise creation
-
βοΈ Draft Editing
Export materials to editing tools with multi-dimensional control over visuals, audio, and subtitles, aiding post-production adjustments and customization
-
π§© Application Components
Built-in application components for efficient collaboration and flexible usage, catering to diverse user needs
-
π§ Preview and Debugging
Flexible and efficient configuration adjustments to ensure perfect presentation at every stage, enhancing creation efficiency and quality
- Multiple providers: support switching between
Original Subtitles
,CapCut Draft
,FunAsr
, andFasterWhisper
- Automatic video download from
YouTube
andTikTok
- Convert video audio to text and extract subtitles
- Multi-track separation for
vocal
,accompaniment
,drums
,bass
, etc - Speaker embedding extraction and alignment with subtitle text
- Multi-speaker recognition
- Emotion recognition: supports
Angry
,Disgusted
,Fearful
,Happy
,Neutral
,Other
,Sad
, andSurprised
- Multiple Providers: Support switching between
OpenAi
,Gemini
andDashScope
- Custom Models and Prompt Commands
- Batch Processing for Long Texts
- Multiple Providers: Support switching between
EdgeTTS
,ElevenLabs
,CosyVoice2
,F5TTS
, andCoquiTTS
- Real-Time Speech Synthesis and Preview
- Voice Library: Includes
Built-in
,Video
, andCustom
voice types - Voice Cloning: Supports
Voice Cloning
,Voice Commands
,Voice Conversion
, andCross-Language Cloning
- Track Control: Supports up to
6
tracks forVisuals
,Audio
, andSubtitles
- Subtitle Generation: Customize
Size
,Position
,Color
, andOutline
settings - Volume Adjustment: Control
Original Sound
,Speech
, andBackground Music
volumes
- Material Extraction: Extract draft materials such as videos, audio, and images.
- Ultimate Vocal Separation: Quickly extract vocals, accompaniment, drums, bass, and other multi-track audio.
- Visual Element Removal: Remove subtitles, watermarks, corner marks, and other visual elements.
- Scene Detection: Automatically detect scene transitions and export segmented clips.
- Subtitle Extraction: Use OCR to recognize embedded subtitles and generate editable text.
- Dubbing Modes: Choose between
Video
,Audio
, andAdaptive
modes - Translation Modes: Translate videos from the original language to another
- Narration Mode: Planned...
- Automated Posting
-
Submit via Issues, Discussions, or Email.
-
Welcome to join Discord for discussions on usage or new features.