-
🔭 I’m currently working on Speech LLM, Multimodal Fusion and Emotion Recognition models
-
📫 You can reach me at qw2443@columbia.edu or joeyventicup@gmail.com.
-
💻 I have a deep interest in Speech Processing and Large Language Models.
-
📄 I hold a Bachelor of Computer Science and Engineering and am pursuing a Master's in Electrical Engineering starting Fall 2024.
-
🍰 My bilibili acount: Venti_J的个人空间
-
⚡ I am also a hip-hop music artist and producer! Check out my Netease Cloud Music: Venti_J的歌手页
Highlights
- Pro
Popular repositories Loading
-
-
-
ChatVITS_Cyberpunk2077
ChatVITS_Cyberpunk2077 PublicChatGPT+VITS using Cyberpunk2077 dataset
Python 2
-
Text-to-sound-Synthesis
Text-to-sound-Synthesis PublicForked from yangdongchao/Text-to-sound-Synthesis
The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"
Python 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.