Projects with this topic
-
🔧 🔗 https://github.com/modelscope/FunASR A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processingUpdated -
https://github.com/Picovoice/cheetah On-device streaming speech-to-text engine powered by deep learning
Updated -
https://github.com/Picovoice/cobra On-device voice activity detection (VAD) powered by deep learning
Updated -
https://github.com/Picovoice/rhino On-device Speech-to-Intent engine powered by deep learning
Updated -
https://github.com/Picovoice/porcupine On-device wake word detection powered by deep learning
Updated -
https://github.com/Picovoice/web-voice-processor A library for real-time voice processing in web browsers.
Updated -
-
🔧 🔗 https://github.com/alphacep/vosk-serverWebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Updated -
🔧 🔗 https://github.com/speechbrain/speechbrain.github.io The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.Updated -
https://github.com/coqui-ai/stt-model-manager Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo
Updated -
https://github.com/Picovoice/leopard
On-device speech-to-text engine powered by deep learning
Updated