What's Changed
Features
- Speaker caching for cloned voices by @eginhard in #438
- For usage details see https://coqui-tts.readthedocs.io/en/latest/cloning.html
⚠️ The old caching mechanism of Bark and Tortoise has been removed, switch to the new one instead.
- Provide
synthesize()
method with a common interface for every TTS model by @eginhard in #453⚠️ Deprecatespeaker_id
argument ofsynthesize()
, usespeaker
instead.⚠️ Deprecateconfig
argument ofsynthesize()
, it can safely be left out.
- Add OpenAI-compatible endpoint to the server by @teddybear082 in #421
See https://coqui-tts.readthedocs.io/en/latest/server.html#openai-compatible-endpoint
Fixes
- Update coqui-tts-trainer to 0.3.0 to fix numerous training-related bugs by @eginhard in #423
For the full list of fixes see https://github.com/idiap/coqui-ai-Trainer/releases/tag/v0.3.0 - fix(configs): add default padding character by @eginhard in #425
- Fix KeyError when speaker_id is empty string in TTS server by @mehulanshumali in #436
- Fix: Update xtts finetuning Colab to support Gradio 5 by @eulphean in #408
- Bring back compatibility with numpy1 by @MarwanMashra in #413
- fix: update XTTS/Tortoise GPT code for HF transformers 4.52+ by @eginhard in #414
- build: lower minimum pytorch version back to 2.1 by @eginhard in #432
- refactor(phonemizer): replace mecab-python3 with fugashi for japanese by @eginhard in #417
- docs: add Docker Compose config by @KishoOoOoOo in #411
New Contributors
- @KishoOoOoOo made their first contribution in #411
- @eulphean made their first contribution in #408
- @MarwanMashra made their first contribution in #413
- @teddybear082 made their first contribution in #421
- @mehulanshumali made their first contribution in #436
Full Changelog: v0.26.2...v0.27.0