请问要怎么输出论文里提到的48kHz音频？ #34

anttxs · 2025-04-11T04:01:16Z

论文里提到：

first converting semantic tokens into the Mel spectrogram via a Mel decoder, and then generating the audio with a high sampling rate of 48 kHz via a super-resolution neural vocoder.

但是README里提供的例子却是按照24000的采样率来保存音频的。

请问要怎么输出48kHz的音频呢？

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

请问要怎么输出论文里提到的48kHz音频？ #34

请问要怎么输出论文里提到的48kHz音频？ #34

anttxs commented Apr 11, 2025

请问要怎么输出论文里提到的48kHz音频？ #34

请问要怎么输出论文里提到的48kHz音频？ #34

Comments

anttxs commented Apr 11, 2025