Build software better, together

keonlee9420 / PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

text-to-speech deep-neural-networks pytorch tts speech-synthesis generative-model vae normalizing-flows high-quality neural-tts non-autoregressive fastspeech hifi-gan non-ar mel-gan portable-tts

Updated Feb 17, 2022
Python

keonlee9420 / DiffGAN-TTS

Star

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

text-to-speech deep-neural-networks pytorch tts speech-synthesis gan generative-model diffusion diffusion-models neural-tts non-autoregressive fastspeech multi-speaker-tts hifi-gan ddpm non-ar diffspeech diffgan-tts single-speaker-tts

Updated Feb 21, 2022
Python

keonlee9420 / Comprehensive-Transformer-TTS

Star

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

Updated Sep 24, 2022
Python

KevinMIN95 / StyleSpeech

Star

Official implementation of Meta-StyleSpeech and StyleSpeech

text-to-speech speech tts speech-synthesis official meta-learning neural-tts stylespeech meta-stylespeech

Updated Feb 9, 2022
Python

keonlee9420 / DiffSinger

Star

PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)

text-to-speech pytorch tts speech-synthesis english diffusion singing-voice diffusion-models neural-tts non-autoregressive fastspeech ddpm diffsinger

Updated Feb 3, 2022
Python

keonlee9420 / StyleSpeech

Star

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

text-to-speech style pytorch tts speech-synthesis english speaker prosody meta-learning one-shot speaker-adaptation neural-tts non-autoregressive fastspeech speech-style stylespeech meta-stylespeech unseen-speaker

Updated Feb 10, 2022
Python

keonlee9420 / Cross-Speaker-Emotion-Transfer

Star

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

text-to-speech deep-neural-networks pytorch tts speech-synthesis generative-model semi-supervised-learning global-style-tokens neural-tts non-autoregressive parallel-tacotron non-ar emotion-transfer cross-speaker conditional-layer-normalization

Updated Nov 9, 2022
Python

keonlee9420 / Parallel-Tacotron2

Star

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

text-to-speech duration pytorch tts speech-synthesis english vae self-attention neural-tts non-autoregressive fastspeech parallel-tacotron parallel-tacotron2

Updated Nov 18, 2021
Python

keonlee9420 / Comprehensive-E2E-TTS

Star

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

text-to-speech deep-learning unsupervised end-to-end pytorch tts speech-synthesis jets multi-speaker sota single-speaker neural-tts non-autoregressive fastspeech2 hifi-gan non-ar ultimate-tts text-to-wav

Updated Jun 6, 2022
Python

keonlee9420 / FastPitchFormant

Star

PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

text-to-speech end-to-end pytorch tts speech-synthesis pitch timbre neural-tts non-autoregressive fastspeech pitch-control fastpitch

Updated Aug 3, 2021
Python

keonlee9420 / VAENAR-TTS

Star

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

text-to-speech duration pytorch tts speech-synthesis vae unsupervised-learning glow self-attention neural-tts non-autoregressive transforer non-ar unsupervised-duration

Updated Aug 3, 2021
Python

mush42 / sonata

Star

A cross-platform inference engine for neural TTS models.

python c text-to-speech grpc tts speech-synthesis neural-tts

Updated Nov 25, 2024
Rust

keonlee9420 / WaveGrad2

Star

PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis

audio text-to-speech duration end-to-end pytorch tts speech-synthesis robust synthesis neural-tts non-autoregressive text-to-audio score-matching phoneme-to-waveform

Updated Aug 3, 2021
Python

keonlee9420 / Daft-Exprt

Star

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

text-to-speech style pytorch tts speech-synthesis english speaker prosody neural-tts non-autoregressive prosody-transfer gaussian-upsampling

Updated Oct 15, 2021
Python

keonlee9420 / Comprehensive-Tacotron2

Star

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

text-to-speech deep-learning efficiency pytorch tts speech-synthesis autoregressive multi-speaker robustness comprehensive tacotron single-speaker neural-tts tacotron2 reduction-factor hifi-gan mel-gan diagonal-guided-attention

Updated Jul 31, 2023
Python

Mobile-Artificial-Intelligence / babylon.cpp

Sponsor

Star

Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.

artificial-intelligence tts g2p onnx grapheme-to-phoneme voice-cloning neural-tts onnxruntime onnx-models phonemization onnx-runtime vits elevenlabs 11labs test-to-speech deep-phonemizer

Updated Aug 22, 2025
Python

keonlee9420 / Deep-Learning-TTS-Template

Star

This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

template text-to-speech deep-learning pytorch tts speech-synthesis neural-tts non-autoregressive fastspeech

Updated Jun 15, 2021
Python

diodiogod / TTS-Audio-Suite

Star

A ComfyUI custom node integration for multi-language High-quality Text-to-Speech and Voice Conversion nodes using multiple engines like RVC, ResembleAI's Chatterbox TTS, F5-TTS and Higgs Audio 2 with unlimited text length, SRT timing, Character support, Audio Analyzer, Silent Speech Analyzer, audio edit and more!!

Updated Aug 22, 2025
Python

QuantiusBenignus / voluble

Star

Let your GNOME desktop speak to you. Reads your desktop notifications or selected text out-loud with human-like voice using Piper. Uses a local LLM to summarize selected text.

notifications machine-learning text-to-speech deep-learning accessibility gnome tts speech-synthesis kiss autoencoder gnome-extension gnome-desktop gnome-shell-extension neural-tts vits

Updated May 28, 2025
JavaScript

yokawasa / vscode-translator-voice

Star

VS Code extension for multi-language text translation and TTS (text-to-speech) using Azure Cognitive Services. Please [✩Star] if you're using it!

translator typescript vscode voice speech tts vscode-extension azure-cognitive-services vscode-translator-voice neural-tts

Updated Aug 11, 2021
TypeScript

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

neural-tts

Here are 23 public repositories matching this topic...

keonlee9420 / PortaSpeech

keonlee9420 / DiffGAN-TTS

keonlee9420 / Comprehensive-Transformer-TTS

KevinMIN95 / StyleSpeech

keonlee9420 / DiffSinger

keonlee9420 / StyleSpeech

keonlee9420 / Cross-Speaker-Emotion-Transfer

keonlee9420 / Parallel-Tacotron2

keonlee9420 / Comprehensive-E2E-TTS

keonlee9420 / FastPitchFormant

keonlee9420 / VAENAR-TTS

mush42 / sonata

keonlee9420 / WaveGrad2

keonlee9420 / Daft-Exprt

keonlee9420 / Comprehensive-Tacotron2

Mobile-Artificial-Intelligence / babylon.cpp

keonlee9420 / Deep-Learning-TTS-Template

diodiogod / TTS-Audio-Suite

QuantiusBenignus / voluble

yokawasa / vscode-translator-voice

Improve this page

Add this topic to your repo