alihan/Kokoro-TTS-Local

mirror of https://github.com/PierrunoYT/Kokoro-TTS-Local.git synced 2025-01-27 02:30:25 +03:00

Go to file

Pierre Bruno 7d32fb6543 Update README with current project status and improved documentation

2025-01-14 15:58:05 +01:00

.gitignore

Initial commit: Kokoro TTS Local implementation

2025-01-14 15:38:03 +01:00

models.py

Fix: Add plbert module dependency

2025-01-14 15:56:27 +01:00

README.md

Update README with current project status and improved documentation

2025-01-14 15:58:05 +01:00

requirements.txt

Update to use espeakng-loader and phonemizer-fork

2025-01-14 15:48:44 +01:00

tts_demo.py

Fix: Resolve circular imports and improve module loading

2025-01-14 15:54:54 +01:00

README.md

Kokoro TTS Local

A local implementation of the Kokoro Text-to-Speech model.

Current Status

⚠️ WORK IN PROGRESS ⚠️

The project is currently being updated to use better dependency management and improved module loading.

Features

Local text-to-speech synthesis using the Kokoro model
Automatic espeak-ng setup using espeakng-loader
Multiple voice support
Phoneme output support

Dependencies

torch
phonemizer-fork
transformers
scipy
munch
soundfile
huggingface-hub
espeakng-loader

Setup

Create a virtual environment:

python -m venv venv
.\venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Usage

Run the demo script:

python tts_demo.py

The script will:

Download necessary model files from Hugging Face
Set up espeak-ng automatically
Generate speech from the test text

Project Structure

models.py: Core model loading and speech generation functionality
tts_demo.py: Demo script showing basic usage
requirements.txt: Project dependencies

Model Information

The project uses the Kokoro-82M model from Hugging Face:

Repository: hexgrad/Kokoro-82M
Model file: kokoro-v0_19.pth
Voice files: Located in the voices/ directory

Contributing

Feel free to contribute by:

Opening issues for bugs or feature requests
Submitting pull requests with improvements
Helping with documentation

License

This project is licensed under the Apache 2.0 License.