TTS经典模型与论文汇总
TTS经典论文或模型:
WaveNet:《WaveNet: A Generative Model for Raw Audio》
Tacotron:《TACOTRON: Towards End-to-End Speech Synthesis》
Tacotron2:《Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions》
HiFi-GAN:《HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis》
VITS:《Conditional Variational Autoencoder with Adversarial Learning for End-toEnd Text-toSpeech》
SV2TTS:《Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis》
github链接:
- https://github.com/JOETtheIV/vits-mandarin-biaobei
- 【SoftVITS】@Rcell的soVITS3.0:https://github.com/innnky/so-vits-svc。成果:AI电棍:浮夸
- 【MockingBird】https://github.com/babysor/MockingBird
- 【DiffSVC】https://github.com/prophesier/diff-svc;『Diff-svc』東の空から始まる世界-姬野星奏Ver.(https://www.bilibili.com/video/BV1LP4y127yL)
- 情感特征提取 https://github.com/audeering/w2v2-how-to
- VITS情感控制语音合成 https://github.com/innnky/emotional-vits
- 在线demo:https://huggingface.co/spaces/innnky/nene-emotion
- 电子书阅读器:https://github.com/gedoor/legado
- 高速语音合成vits:https://github.com/MasayaKawamura/MB-iSTFT-VITS
- WaveNet、Tacotron、Tacotron2、Hifi-GAN、、FastSpeech、Glow-TTS
TTS模型和项目地址:
- VITS:https://gitcode.net/mirrors/dtx525942103/vits_chinese
- 【MoeTTS、Tacotron2+HifiGAN】-落忆-:https://space.bilibili.com/228292951;『MoeTTS』基于Tacotron2+HifiGAN 近乎完美的ATRI语音合成:https://www.bilibili.com/video/BV1Tr4y1577U; MoeTTS是一个Tacotron2/HifiGAN模型+编译好的GUI版本发布仓库。 项目地址:https://github.com/luoyily/MoeTTS
- 【Tacotron2】CjangCjengh:https://space.bilibili.com/35285881;基于tacotron2合成宁宁语音(day1-2):https://www.bilibili.com/video/BV1rV4y177Z7
- Rcell:https://space.bilibili.com/343303724
- 【VITS】(Venti_J)【原神】派蒙Vtuber出道计划——基于AI深度学习VITS和VSeeFace的派蒙语音合成/套皮:https://www.bilibili.com/video/BV16G4y1B7Ey
NLP经典论文:
《Attention Is All You Need》2017-06-12:https://arxiv.org/abs/1706.03762
《Sequence to Sequence Learning with Neural Networks》2014-09-10:https://arxiv.org/abs/1409.3215
优质博客:
《Understanding LSTM Networks》-Colah
综述博客:《BLSTM-RNN、Deep Voice、Tacotron…你都掌握了吗?一文总结语音合成必备经典模型(一)》:https://new.qq.com/rain/a/20221204A02GIT00
2019 深度学习语音合成指南:https://www.bilibili.com/read/cv3532474/
语音合成方法、模型训练方法、设备及存储介质与流程:https://www.xjishu.com/zhuanli/21/202111674186.html