Skip to content

xiesx123/CreatorBox

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

CreatorBox πŸ’Έ

VitePress Discord Version GitHub stars Google Colab Kaggle

English | δΈ­ζ–‡

πŸš€πŸŽ¬ Flexible, efficient, and scalable toolbox for editing and dubbing, unleashing creative potential

Web Interface

Dubbing/Editing

πŸ”§ Dubbing βœ‚οΈ Editing
Click to watch video Click to watch video

Video Demonstration

▢️ Original ▢️ Dubbed
dahuaxiyou_zh.mp4
dahuaxiyou_en.mp4
qiuzhimianshi_en.mp4
qiuzhimianshi_zh.mp4
wukong_zh.mp4
wukong_en.mp4
shangpin.jieshao_en.mp4
shangpin.jieshao_zh.mp4

πŸ“¦ Quick Start

πŸ‘‰ Refer to the usage guide: Local Installation | Remote Deployment

🎨 Applicable Scenarios

  • πŸŽ₯ Content Creators: Optimize video dubbing, translation, and editing workflows to enhance efficiency and unleash creative potential

  • 🌍 Multilingual Translation/Dubbing: Create localized content for overseas audiences and publish across languages

  • βš™οΈ Independent Deployment: Deploy locally with flexible configurations to ensure privacy

🎯 Features

  • 🎀 Subtitle Recognition

    Accurately transcribe voices from videos and audio, with flexible configuration to adapt to different devices and scenarios, ensuring high-quality text generation

  • 🌐 Language Translation

    Translate between multiple languages, switch translation providers, and adjust advanced parameters to optimize translation results and overcome language barriers

  • 🎧 Speech Synthesis

    A rich library of voices and customization options for personalized dubbing experiences, meeting creative needs with real-time previews for precise creation

  • βœ‚οΈ Draft Editing

    Export materials to editing tools with multi-dimensional control over visuals, audio, and subtitles, aiding post-production adjustments and customization

  • 🧩 Application Components

    Built-in application components for efficient collaboration and flexible usage, catering to diverse user needs

  • πŸ”§ Preview and Debugging

    Flexible and efficient configuration adjustments to ensure perfect presentation at every stage, enhancing creation efficiency and quality

πŸ“… Planned Support

Subtitles

  • Multiple providers: support switching between Original Subtitles, CapCut Draft, FunAsr, and FasterWhisper
  • Automatic video download from YouTube and TikTok
  • Convert video audio to text and extract subtitles
  • Multi-track separation for vocal, accompaniment, drums, bass, etc
  • Speaker embedding extraction and alignment with subtitle text
  • Multi-speaker recognition
  • Emotion recognition: supports Angry, Disgusted, Fearful, Happy, Neutral, Other, Sad, and Surprised

Translation

  • Multiple Providers: Support switching between OpenAi, Gemini and DashScope
  • Custom Models and Prompt Commands
  • Batch Processing for Long Texts

Speech

  • Multiple Providers: Support switching between EdgeTTS, ElevenLabs, CosyVoice2, F5TTS, and CoquiTTS
  • Real-Time Speech Synthesis and Preview
  • Voice Library: Includes Built-in, Video, and Custom voice types
  • Voice Cloning: Supports Voice Cloning, Voice Commands, Voice Conversion, and Cross-Language Cloning

Draft

  • Track Control: Supports up to 6 tracks for Visuals, Audio, and Subtitles
  • Subtitle Generation: Customize Size, Position, Color, and Outline settings
  • Volume Adjustment: Control Original Sound, Speech, and Background Music volumes

Applications

  • Material Extraction: Extract draft materials such as videos, audio, and images.
  • Ultimate Vocal Separation: Quickly extract vocals, accompaniment, drums, bass, and other multi-track audio.
  • Visual Element Removal: Remove subtitles, watermarks, corner marks, and other visual elements.
  • Scene Detection: Automatically detect scene transitions and export segmented clips.
  • Subtitle Extraction: Use OCR to recognize embedded subtitles and generate editable text.

Others

  • Dubbing Modes: Choose between Video, Audio, and Adaptive modes
  • Translation Modes: Translate videos from the original language to another
  • Narration Mode: Planned...
  • Automated Posting

Feedback and Suggestions πŸ“’

Star History

Star History Chart