google Cloud Text to Speech 的质量更好

如何解决google Cloud Text to Speech 的质量更好

我在 Integromat 中为我的 Adalo 应用程序使用 Google Cloud Text to Speech 模块。我想要更好的音频质量，所以我已经切换到 WaveNet 并将采样率增加到 48000 赫兹，但它的质量仍然很差。我能做什么？非常感谢每个想法，提前致谢！

最好的问候，本

解决方法

您可能需要根据 https://cloud.google.com/text-to-speech/docs/audio-profiles

处的文档指定 effectsProfileId

const effectsProfileId = ['telephony-class-application'];

const request = {
  input: {text: text},voice: {languageCode: languageCode,ssmlGender: ssmlGender},audioConfig: {audioEncoding: 'MP3',effectsProfileId: effectsProfileId},};

语音质量的高低取决于您所说的播放结果音频的设备类型。

将您的数据转换为 Google 提到的推荐编码。使用与 Google 文档中提到的相同的格式，如 Flac 格式。这将提供适当的准确性。使用立体声录音，单独的香奈儿扬声器。