alihan/Fast-Whisper-MCP-Server: A high-performance speech recognition MCP server based on Faster Whisper, providing efficient audio transcription capabilities. - Fast-Whisper-MCP-Server - gitea-ailhan-registry

alihan/Fast-Whisper-MCP-Server

Go to file

Alihan 1292f0f09b Add GPU auto-reset, job queue, health monitoring, and test infrastructure

Major features:
- GPU auto-reset on CUDA errors with cooldown protection (handles sleep/wake)
- Async job queue system for long-running transcriptions
- Comprehensive GPU health monitoring with real model tests
- Phase 1 component testing with detailed logging

New modules:
- src/core/gpu_reset.py: GPU driver reset with 5-min cooldown
- src/core/gpu_health.py: Real GPU health checks using model inference
- src/core/job_queue.py: FIFO queue with background worker and persistence
- src/utils/test_audio_generator.py: Test audio generation for GPU checks
- test_phase1.py: Component tests with logging
- reset_gpu.sh: GPU driver reset script

Updates:
- CLAUDE.md: Added GPU auto-reset docs and passwordless sudo setup
- requirements.txt: Updated to PyTorch CUDA 12.4
- Model manager: Integrated GPU health check with reset
- Both servers: Added startup GPU validation with auto-reset
- Startup scripts: Added GPU_RESET_COOLDOWN_MINUTES env var

2025-10-09 23:13:11 +03:00

.github/workflows

Create python-app.yml

2025-03-22 13:40:58 +08:00

Add GPU auto-reset, job queue, health monitoring, and test infrastructure

2025-10-09 23:13:11 +03:00

Add GPU auto-reset, job queue, health monitoring, and test infrastructure

2025-10-09 23:13:11 +03:00

.gitignore

transcription flow cilalama, bugfixes

2025-06-15 17:50:05 +03:00

.python-version

feat: 初始化基于Faster Whisper的语音识别MCP服务器

2025-03-22 03:23:54 +08:00

CLAUDE.md

Add GPU auto-reset, job queue, health monitoring, and test infrastructure

2025-10-09 23:13:11 +03:00

data

transcription flow cilalama, bugfixes

2025-06-15 17:50:05 +03:00

DEV_PLAN.md

Add GPU auto-reset, job queue, health monitoring, and test infrastructure

2025-10-09 23:13:11 +03:00

Dockerfile

Refactor codebase structure with organized src/ directory

2025-10-07 12:28:03 +03:00

mcp.logs

Add GPU auto-reset, job queue, health monitoring, and test infrastructure

2025-10-09 23:13:11 +03:00

pyproject.toml

Refactor codebase structure with organized src/ directory

2025-10-07 12:28:03 +03:00

requirements.txt

Add GPU auto-reset, job queue, health monitoring, and test infrastructure

2025-10-09 23:13:11 +03:00

reset_gpu.sh

Add GPU auto-reset, job queue, health monitoring, and test infrastructure

2025-10-09 23:13:11 +03:00

run_api_server.sh

Add GPU auto-reset, job queue, health monitoring, and test infrastructure

2025-10-09 23:13:11 +03:00

run_mcp_server.sh

Add GPU auto-reset, job queue, health monitoring, and test infrastructure

2025-10-09 23:13:11 +03:00

test_phase1.py

Add GPU auto-reset, job queue, health monitoring, and test infrastructure

2025-10-09 23:13:11 +03:00

uv.lock

refactor(whisper_server): 重构代码以模块化转录功能

2025-03-22 05:26:17 +08:00