mirror of
https://github.com/PierrunoYT/Kokoro-TTS-Local.git
synced 2025-01-27 02:30:25 +03:00
7d32fb654368eb84d390350674fcd68329fe23bd
Kokoro TTS Local
A local implementation of the Kokoro Text-to-Speech model.
Current Status
⚠️ WORK IN PROGRESS ⚠️
The project is currently being updated to use better dependency management and improved module loading.
Features
- Local text-to-speech synthesis using the Kokoro model
- Automatic espeak-ng setup using espeakng-loader
- Multiple voice support
- Phoneme output support
Dependencies
torch
phonemizer-fork
transformers
scipy
munch
soundfile
huggingface-hub
espeakng-loader
Setup
- Create a virtual environment:
python -m venv venv
.\venv\Scripts\activate
- Install dependencies:
pip install -r requirements.txt
Usage
Run the demo script:
python tts_demo.py
The script will:
- Download necessary model files from Hugging Face
- Set up espeak-ng automatically
- Generate speech from the test text
Project Structure
models.py: Core model loading and speech generation functionalitytts_demo.py: Demo script showing basic usagerequirements.txt: Project dependencies
Model Information
The project uses the Kokoro-82M model from Hugging Face:
- Repository: hexgrad/Kokoro-82M
- Model file:
kokoro-v0_19.pth - Voice files: Located in the
voices/directory
Contributing
Feel free to contribute by:
- Opening issues for bugs or feature requests
- Submitting pull requests with improvements
- Helping with documentation
License
This project is licensed under the Apache 2.0 License.
Description
Languages
Python
92%
Shell
4.2%
PowerShell
3.8%