Skip to content

请问要怎么输出论文里提到的48kHz音频? #34

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
anttxs opened this issue Apr 11, 2025 · 0 comments
Open

请问要怎么输出论文里提到的48kHz音频? #34

anttxs opened this issue Apr 11, 2025 · 0 comments

Comments

@anttxs
Copy link

anttxs commented Apr 11, 2025

论文里提到:

first converting semantic tokens into the Mel spectrogram via a Mel decoder, and then generating the audio with a high sampling rate of 48 kHz via a super-resolution neural vocoder.

但是README里提供的例子却是按照24000的采样率来保存音频的。

请问要怎么输出48kHz的音频呢?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant