pranay-j
's Collections
Text to Speech Architectures
updated
FastPitch: Parallel Text-to-speech with Pitch Prediction
Paper
•
2006.06873
•
Published
HiFi-GAN: Generative Adversarial Networks for Efficient and High
Fidelity Speech Synthesis
Paper
•
2010.05646
•
Published
Tacotron: Towards End-to-End Speech Synthesis
Paper
•
1703.10135
•
Published
Parallel Tacotron: Non-Autoregressive and Controllable TTS
Paper
•
2010.11439
•
Published
Flowtron: an Autoregressive Flow-based Generative Network for
Text-to-Speech Synthesis
Paper
•
2005.05957
•
Published
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram
Discriminators for High-Fidelity Waveform Generation
Paper
•
2106.07889
•
Published
WaveGlow: A Flow-based Generative Network for Speech Synthesis
Paper
•
1811.00002
•
Published
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language
Processing
Paper
•
2110.07205
•
Published
•
5
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and
Diffusion Models
Paper
•
2403.03100
•
Published
•
34
WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on
Fixed-Point Iteration
Paper
•
2210.01029
•
Published
•
1