Skip to content

Conversation

ivanfioravanti
Copy link
Collaborator

##Context
Only English was supported for Kokoro and all other models were missing from web interface.

##Description

  • Added support for all kokoro languages and voices (Language selection added to html page and passed to server. All languages in https://huggingface.co/hexgrad/Kokoro-82M/blob/main/VOICES.md added.)
  • Added support for CSM/Sesame
  • Updated TTS endpoint to handle speed, pitch, and gender parameters for Spark models.
  • Modified audio player UI to dynamically load model types, languages, and voices based on selected model capabilities.
  • Added support for Spark-specific controls including speech speed, pitch, and gender selection.
  • Improved JavaScript logic for fetching models and updating UI elements accordingly.

Closes #30 and #77

ivanfioravanti and others added 4 commits August 17, 2025 15:06
- Added /models endpoint to list available TTS models with configurations
- Extended model dropdown to include Kokoro, CSM/Sesame, Bark, OuteTTS, and Spark
- Added model-specific UI elements (reference audio upload for voice cloning)
- Updated JavaScript for dynamic UI changes based on selected model capabilities
- Enhanced server TTS endpoint to handle model-specific parameters
- Support for reference audio in CSM/Sesame models for voice cloning
- Dynamic language/voice options based on model capabilities
…eters

- Added support for all kokoro languages and voices
- Added support for CSM/Sesame
- Updated TTS endpoint to handle speed, pitch, and gender parameters for Spark models.
- Modified audio player UI to dynamically load model types, languages, and voices based on selected model capabilities.
- Added support for Spark-specific controls including speech speed, pitch, and gender selection.
- Improved JavaScript logic for fetching models and updating UI elements accordingly.
@@ -390,6 +487,89 @@ def stop_audio():
)


@app.get("/languages")
Copy link
Owner

@Blaizzy Blaizzy Aug 17, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks good!
However, the approach will change soon. Server won't have languages or models endpoints to simplify it because there are model specific.

We can return directly from the model config or object.

See these:

Copy link
Owner

@Blaizzy Blaizzy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@Blaizzy Blaizzy merged commit 63aa585 into Blaizzy:main Aug 17, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Support for Italian language ?
2 participants