Commit Graph

20 Commits

Author SHA1 Message Date
Pierre Bruno
f7753ccb62 Enhance Gradio interface and audio conversion capabilities
- Added audio format conversion functionality using pydub, supporting WAV, MP3, and AAC formats.
- Improved error handling for voice directory access and audio conversion processes.
- Updated README to reflect new web interface features and installation requirements, including FFmpeg.
- Enhanced the TTS generation function to utilize the correct Python interpreter across platforms.
- Documented new features in the README, including real-time progress monitoring and network sharing capabilities.
2025-01-16 16:19:31 +01:00
Pierre Bruno
49e19f0c51 Merge remote-tracking branch 'teslanaut/feature/gradio-interface' into gradio-ui 2025-01-16 16:07:54 +01:00
Pierre Bruno
1e875aba99 Enhance TTS demo with voice validation and progress indicators
- Introduced load_and_validate_voice function to ensure requested voice exists before loading.
- Added command-line options for model path, output file, and language code with default values.
- Implemented progress indicators using tqdm for model and voice loading, as well as speech generation.
- Updated default text handling and ensured proper cleanup of resources after execution.
2025-01-15 18:22:54 +01:00
Pip
1e61270e6c Added Gradio web UI & updated models.py to work with the UI. Tested on Linux Mint & MacOS Sequoia, not tested on Windows 2025-01-15 09:14:18 -08:00
Pierre Bruno
26ee1128a5 Add config.json download in build_model function 2025-01-15 09:18:46 +01:00
Pierre Bruno
34809a8073 Update README with cross-platform setup instructions and scripts 2025-01-14 16:20:48 +01:00
Pierre Bruno
36c5c7f85d Update README with comprehensive technical details and current status 2025-01-14 16:19:15 +01:00
Pierre Bruno
2871b2b47d Update README with voice listing feature and improved documentation 2025-01-14 16:18:10 +01:00
Pierre Bruno
c6c4b0b39d Add voice listing functionality
- Add list_available_voices() function to models.py
- Add --list-voices argument to tts_demo.py
- Enable users to view all available voice options
2025-01-14 16:14:48 +01:00
Pierre Bruno
829df3f1ba Add Apache 2.0 LICENSE with copyright notice 2025-01-14 16:08:22 +01:00
Pierre Bruno
79f2285a6a Add interactive CLI and command-line options for custom text input 2025-01-14 15:59:37 +01:00
Pierre Bruno
7d32fb6543 Update README with current project status and improved documentation 2025-01-14 15:58:05 +01:00
Pierre Bruno
2d81e92b33 Fix: Add plbert module dependency 2025-01-14 15:56:27 +01:00
Pierre Bruno
9ab9ad1f59 Fix: Reorder module imports to handle dependencies correctly 2025-01-14 15:55:42 +01:00
Pierre Bruno
40ce7ecb4f Fix: Resolve circular imports and improve module loading 2025-01-14 15:54:54 +01:00
Pierre Bruno
f0e8343a7d Update to use espeakng-loader and phonemizer-fork 2025-01-14 15:48:44 +01:00
Pierre Bruno
43fe839629 Update to use official Kokoro implementation 2025-01-14 15:46:50 +01:00
Pierre Bruno
6f38c34998 Update to use espeakng-loader for automatic espeak-ng installation 2025-01-14 15:43:16 +01:00
Pierre Bruno
6745624ca0 Update README: Add current status and call for help 2025-01-14 15:39:25 +01:00
Pierre Bruno
9eb71b699d Initial commit: Kokoro TTS Local implementation 2025-01-14 15:38:03 +01:00