A high-performance speech recognition MCP server based on Faster Whisper, providing efficient audio transcription capabilities.
Updated 2025-10-27 23:17:52 +03:00
AI-powered CLI that translates natural language into safe, reviewable ffmpeg commands.
Updated 2025-10-09 13:42:56 +03:00
plug whisper audio transcription to a local ollama server and ouput tts audio responses
Updated 2024-04-20 16:48:11 +03:00
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, vid
Updated 2023-05-24 23:26:06 +03:00
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Updated 2023-05-08 12:41:45 +03:00