Projects with this topic
Sort by:
-
pyannote audio
🔧 🔗 https://github.com/pyannote/pyannote-audio Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker emUpdated -
🔧 🔗 https://github.com/modelscope/FunASR A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processingUpdated -
https://github.com/Picovoice/cobra On-device voice activity detection (VAD) powered by deep learning
Updated -
https://github.com/coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech TechnologiesUpdated