mirror of
https://github.com/PierrunoYT/Kokoro-TTS-Local.git
synced 2025-01-27 02:30:25 +03:00
6745624ca0c5bf5fa1c93052348d50d2c2182296
Kokoro TTS
⚠️ CURRENT STATUS: NOT WORKING - HELP NEEDED ⚠️
This project is currently experiencing issues with the espeak-ng integration and phonemizer library. We're looking for contributors who can help fix these problems.
A local implementation of the Kokoro Text-to-Speech system, based on the Kokoro-82M model.
Known Issues
- Phonemizer library fails to detect espeak-ng even when properly installed
- Integration between Python phonemizer and espeak-ng needs fixing
- Model loading process needs debugging
Features (When Working)
- High-quality English text-to-speech synthesis
- Multiple voice styles
- Adjustable speech speed
- Local inference without internet dependency (after initial model download)
Prerequisites
- Python 3.8 or higher
- espeak-ng (text-to-speech engine)
- Git LFS (for model download)
Installation
-
Install espeak-ng:
- Download from espeak-ng releases
- Install for all users
- Ensure espeak-ng-data directory is properly set up
-
Set up Python environment:
# Create and activate virtual environment
python -m venv venv
.\venv\Scripts\activate # Windows
source venv/bin/activate # Linux/Mac
# Install dependencies
pip install -r requirements.txt
Project Structure
tts_demo.py- Demo script showing basic usagemodels.py- Model implementation and utilitiesrequirements.txt- Python dependencies
How to Contribute
If you have experience with:
- Python text-to-speech systems
- espeak-ng integration
- phonemizer library
- PyTorch model deployment
Please consider contributing to fix the current issues. You can:
- Fork the repository
- Create a branch for your fixes
- Submit a pull request
Credits
- Original model: hexgrad/Kokoro-82M
- Based on papers:
License
Apache-2.0 License (following the original model's license)
Description
Languages
Python
92%
Shell
4.2%
PowerShell
3.8%