Interactive-LLM-VTuber

Project Overview

Interactive-LLM-VTuber is an innovative platform for interactive virtual streamers, leveraging advanced AI technologies to deliver an immersive user experience. The project supports voice input, text generation, and voice output, with high scalability. Currently in development are features like long-term memory, image recognition, and sentiment analysis. Future plans include local deployment, deep reinforcement learning, system integration, framework optimization, and embedded device support to build an intelligent VTuber ecosystem.

Feature Highlights

Real-time Voice Interaction: Enables natural language input via automatic speech recognition (ASR).
Intelligent Conversation: Supports multiple large language models (LLMs), including Tongyi Qianwen, Deepseek (online), and Ollama2.5:7b (local offline).
Speech Synthesis: Utilizes Edge-TTS for smooth text-to-speech output.
Dynamic Front-end: Built with Flask, HTML, JavaScript, and CSS for an intuitive user interface.
Modular Design: Facilitates feature expansion and third-party integration.

Demo

Showcasing real-time interaction with the VTuber model.

Technology Stack

Programming Language: Python
Speech Recognition (ASR): speech_recognition (online)
Large Language Models (LLMs):
- Tongyi Qianwen (online)
- Deepseek (online)
- Ollama2.5:7b (local offline)
Text-to-Speech (TTS): edge-tts (online)
Front-end and Back-end Interaction: Flask + HTML + JavaScript + CSS

Note: Some models may require specific configurations for compatibility.

Supported Platforms

Windows: Fully tested and stable.
Linux: Theoretically compatible (testing recommended).

Installation and Use

Prerequisites

Install VSCode or PyCharm.
Install Python 3.11 interpreter.
(Optional) Use a virtual environment to isolate dependencies.

Steps

Clone the project and enter the directory:

git clone https://github.com/toke648/AI-Interactive-LLM-VTuber.git
cd AI-Interactive-LLM-VTuber

Create and activate a virtual environment:

Windows:

python -m venv vtuber
vtuber\Scripts\activate

Conda Environment:

conda create -n vtuber python=3.11
conda activate vtuber

Linux/macOS:

python -m venv vtuber
source vtuber/bin/activate

Install dependencies:
```
pip install -r requirements.txt
```
Configure API:
- Edit mainsetting.py to configure API keys (e.g., for Tongyi Qianwen or Ollama) and other settings.
Start the project:
```
python server.py
```
Or use the one-click startup script (Windows):
```
setup.bat
```

Other Configurations

Port Modification: Adjust the port or other settings in mainsetting.py.
Model Switching: Modify the cubism4Model variable in static/js/appserver.js to switch VTuber models (not yet integrated into the UI).
System Settings: Access the configuration page via the “Settings” button in the UI. Restart the project to apply changes.

Update Log (Version 0.4.0)

One-click Startup: Added setup.bat script to simplify the startup process for Windows users.
Model Switching: Supports manual VTuber model switching by modifying the path in static/js/appserver.js.
System Configuration Page: Added a settings interface, accessible via the “Settings” button. Restart the project to apply changes.

Notes

Ensure API keys and environment variables are correctly configured for LLM and TTS functionality.
Linux users may need to verify compatibility. Feedback is welcome via GitHub Issues.
The project is actively updated. Follow the GitHub repository for the latest updates.

License

This project is licensed under the MIT License. Contributions and suggestions are warmly welcomed!

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.idea		.idea
__pycache__		__pycache__
audio		audio
llm		llm
music		music
sr		sr
static		static
templates		templates
tts		tts
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
Screenshot 2025-01-01 174024-demo.png		Screenshot 2025-01-01 174024-demo.png
main_setting.json		main_setting.json
main_setting.py		main_setting.py
mcp_test.py		mcp_test.py
mcp_tool.py		mcp_tool.py
requirements.txt		requirements.txt
server.py		server.py
setup.bat		setup.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Interactive-LLM-VTuber

Project Overview

Feature Highlights

Demo

Technology Stack

Supported Platforms

Installation and Use

Prerequisites

Steps

Other Configurations

Update Log (Version 0.4.0)

Notes

License

About

Uh oh!

Releases 4

Packages

Uh oh!

Languages

License

toke648/AI-Interactive-LLM-VTuber

Folders and files

Latest commit

History

Repository files navigation

Interactive-LLM-VTuber

Project Overview

Feature Highlights

Demo

Technology Stack

Supported Platforms

Installation and Use

Prerequisites

Steps

Other Configurations

Update Log (Version 0.4.0)

Notes

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Languages

Packages