Skip to content

Improve AudioEncoder #208

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
zhzLuke96 opened this issue Mar 4, 2025 · 1 comment
Open

Improve AudioEncoder #208

zhzLuke96 opened this issue Mar 4, 2025 · 1 comment
Labels
enhancement New feature or request performance Performance issues Story Next iteration summary and TODO list

Comments

@zhzLuke96
Copy link
Member

增强 audio encoder 性能

现在音频编码使用 ffmpeg ,延迟有点高,并且不同运行环境可能还有不同的问题
应该尝试增强音频编码性能

@zhzLuke96 zhzLuke96 added enhancement New feature or request performance Performance issues labels Mar 4, 2025
@zhzLuke96
Copy link
Member Author

记录下其他库的观察现象:

  • 在 webui 中,gradio可以输出mp3格式,但是其实我们传给 gradio 的只是简单的 pcm wav 流,按道理说它应该也要转码的,但是 gradio 中貌似没有转码这个过程? 值得学习一波 👍
  • 在使用 azure tts 的时候转码延迟也非常低,不太像是使用 gpu 加速之类的做法,也许有什么办法可以硬转码将 pcm 转为其他格式?

@zhzLuke96 zhzLuke96 added the Story Next iteration summary and TODO list label Mar 4, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request performance Performance issues Story Next iteration summary and TODO list
Projects
None yet
Development

No branches or pull requests

1 participant