I am a master student at College of Computer Science and Technology, Zhejiang University, majoring in Computer Science.
Currently I work on the Audio Research Team at Zhejiang University, under the supervision of Prof. Zhou Zhao. Previously I graduated from Turing Class, a program established by Chu Kochen Honors College, with a bachelor's degree in Artificial Intelligence.
My research interests primarily focus on Multi-Modal Generative AI, specifically in Spatial Audio, Music, Singing, and Speech. I have published papers at top international AI conference, including NeurIPS, ACL, AAAI and ACM-MM. Currently, I am working on Spatial Audio Generation and Immersive Audio Synthesis.
I am actively looking for academic collaboration, feel free to contact me via email at panch@zju.edu.cn.
- Personal Pages: https://david-pigeon.github.io (updated recentlyπ₯)
- Linkedin: https://www.linkedin.com/in/changhao-pan-4032b8317
- Google Scholar: https://scholar.google.com/citations?user=lAH4cq8AAAAJ
- DBLP: https://dblp.org/pid/382/3463.html
- denotes co-first authors
-
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks Yu Zhang*, Changhao Pan*, Wenxiang Guo*, et al. NeurIPS, 2024(Spotlight)
-
TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis, Yu Zhang*, Wenxiang Guo, Changhao Pan*, et al. ACL, 2025(Findings)
-
STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation, Wenxiang Guo*, Yu Zhang*, Changhao Pan*, et al. ACL, 2025(Findings)
-
TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching, Wenxiang Guo, Yu Zhang, Changhao Pan, et al. AAAI 2025
-
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control, Yu Zhang, Ziyue Jiang, Ruiqi Li, Changhao Pan, et al. EMNLP, 2024
-
ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting, Yu Zhang*, Wenxiang Guo*, Changhao Pan*, et al. ACM-MM 2025
-
A Multimodal Evaluation Framework for Spatial Audio Playback Systems: From Localization to Listener Preference, Changhao Pan#, Wenxiang Guo, Yu Zhang, et al. ACM-MM 2025.
- Versatile Framework for Song Generation with Prompt-based Control, Yu Zhang*, Wenxiang Guo*, Changhao Pan*, et al.
- Interactive Table Synthesis with Natural Language Yanwei Huang, Yunfan Zhou, Ran Chen, Changhao Pan, Xinhuan Shu, Di Weng, Yingcai Wu. IEEE TVCG, 2023
-
Outstanding Graduate of Zhejiang Province (Undergraduate), 2025
-
Chu Kochen Scholarship(as undergraduate), 2024
- Highest scholarship at Zhejiang University
-
National Scholarship for three consecutive years (2022 & 2023 & 2024);
-
Outstanding Undergrudates of CCF (2024);
-
BaoGang Elite Scholarship (2023).
-
Conference Reviewer: NeurIPS 2025, ACL 2025
-
Assist to Review: CVPR 2025, EMNLP 2025, ACM MM 2025