Projects with this topic
-
https://github.com/Picovoice/cheetah On-device streaming speech-to-text engine powered by deep learning
Updated -
-
-
🔧 🔗 https://github.com/gpustack/vox-boxA text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
Updated -
[
🔧 🔗 https://github.com/alphacep/vosk-api](https://github.com/alphacep/vosk-apiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Updated -
🔧 🔗 https://github.com/SYSTRAN/faster-whisperFaster Whisper transcription with CTranslate2
Updated -
🔧 🔗 https://github.com/m-bain/whisperX WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)Updated -
🔧 🔗 https://github.com/modelscope/FunClip Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.Updated -
🔧 🔗 https://github.com/huggingface/speech-to-speechSpeech To Speech: an effort for an open-sourced and modular GPT4-o
Updated -
🔧 🔗 https://github.com/modelscope/FunCodecFunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Updated -
https://github.com/coqui-ai/STT
🐸 STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.Updated -
https://github.com/coqui-ai/STT-examples
🐸 STT integration examples github.com/coqui-ai/STTUpdated