TTS经典模型与论文汇总

TTS经典论文或模型:

WaveNet:《WaveNet: A Generative Model for Raw Audio》

Tacotron:《TACOTRON: Towards End-to-End Speech Synthesis》

Tacotron2:《Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions

HiFi-GAN:《HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis》

VITS:《Conditional Variational Autoencoder with Adversarial Learning for End-toEnd Text-toSpeech》

SV2TTS:《Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis

github链接

  • https://github.com/JOETtheIV/vits-mandarin-biaobei
  • 【SoftVITS】@Rcell的soVITS3.0:https://github.com/innnky/so-vits-svc。成果:AI电棍:浮夸
  • 【MockingBird】https://github.com/babysor/MockingBird
  • 【DiffSVC】https://github.com/prophesier/diff-svc;『Diff-svc』東の空から始まる世界-姬野星奏Ver.(https://www.bilibili.com/video/BV1LP4y127yL)
  • 情感特征提取 https://github.com/audeering/w2v2-how-to
  • VITS情感控制语音合成 https://github.com/innnky/emotional-vits
  • 在线demo:https://huggingface.co/spaces/innnky/nene-emotion
  • 电子书阅读器:https://github.com/gedoor/legado
  • 高速语音合成vits:https://github.com/MasayaKawamura/MB-iSTFT-VITS
  • WaveNet、Tacotron、Tacotron2、Hifi-GAN、、FastSpeech、Glow-TTS

TTS模型和项目地址

  • VITS:https://gitcode.net/mirrors/dtx525942103/vits_chinese
  • 【MoeTTS、Tacotron2+HifiGAN】-落忆-:https://space.bilibili.com/228292951;『MoeTTS』基于Tacotron2+HifiGAN 近乎完美的ATRI语音合成:https://www.bilibili.com/video/BV1Tr4y1577U; MoeTTS是一个Tacotron2/HifiGAN模型+编译好的GUI版本发布仓库。 项目地址:https://github.com/luoyily/MoeTTS
  • Rcell:https://space.bilibili.com/343303724
  • 【VITS】(Venti_J)【原神】派蒙Vtuber出道计划——基于AI深度学习VITS和VSeeFace的派蒙语音合成/套皮:https://www.bilibili.com/video/BV16G4y1B7Ey

NLP经典论文:

《Attention Is All You Need》2017-06-12:https://arxiv.org/abs/1706.03762
《Sequence to Sequence Learning with Neural Networks》2014-09-10:https://arxiv.org/abs/1409.3215

优质博客:

《Understanding LSTM Networks》-Colah

综述博客:《BLSTM-RNN、Deep Voice、Tacotron…你都掌握了吗?一文总结语音合成必备经典模型(一)》:https://new.qq.com/rain/a/20221204A02GIT00

2019 深度学习语音合成指南:https://www.bilibili.com/read/cv3532474/

语音合成方法、模型训练方法、设备及存储介质与流程:https://www.xjishu.com/zhuanli/21/202111674186.html

posted @ 2022-11-12 15:53  倦鸟已归时  阅读(1107)  评论(0编辑  收藏  举报