Projects with this topic
-
-
https://github.com/Picovoice/cheetah On-device streaming speech-to-text engine powered by deep learning
Updated -
https://github.com/Picovoice/web-voice-processor A library for real-time voice processing in web browsers.
Updated -
🔧 🔗 https://github.com/SYSTRAN/faster-whisperFaster Whisper transcription with CTranslate2
Updated -
-
[
🔧 🔗 https://github.com/alphacep/vosk-api](https://github.com/alphacep/vosk-apiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Updated -
🔧 🔗 https://github.com/alphacep/awesome-russian-speechRussian speech technology links
Updated -
🔧 🔗 https://github.com/m-bain/whisperX WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)Updated -
🔧 🔗 https://github.com/gpustack/vox-boxA text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
Updated -
🔧 🔗 https://github.com/modelscope/FunClip Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.Updated -
🔧 🔗 https://github.com/speechbrain/speechbrain.github.io The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.Updated -
🔧 🔗 https://github.com/huggingface/speech-to-speechSpeech To Speech: an effort for an open-sourced and modular GPT4-o
Updated -
🔧 🔗 https://github.com/modelscope/FunCodecFunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Updated -
https://github.com/coqui-ai/STT
🐸 STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.Updated -
https://github.com/coqui-ai/STT-models Open models for Coqui STT coqui.ai
Updated -
https://github.com/coqui-ai/STT-examples
🐸 STT integration examples github.com/coqui-ai/STTUpdated -
https://github.com/coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech TechnologiesUpdated -
https://github.com/Picovoice/leopard
On-device speech-to-text engine powered by deep learning
Updated