speech-recognition
Projects with this topic
-
🔧 🔗 https://github.com/modelscope/FunASR A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processingUpdated -
huggingface.co/transformers https://github.com/huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.Updated -
-
https://github.com/Picovoice/porcupine On-device wake word detection powered by deep learning
Updated -
https://github.com/Picovoice/cheetah On-device streaming speech-to-text engine powered by deep learning
Updated -
https://github.com/Picovoice/rhino On-device Speech-to-Intent engine powered by deep learning
Updated -
https://github.com/homebrewltd/AudioBench AudioBench: A Universal Benchmark for Audio Large Language Models
🔗 https://arxiv.org/abs/2406.16020Updated -
🔧 🔗 https://github.com/bytedance/SALMONN SALMONN: Speech Audio Language Music Open Neural NetworkUpdated -
🔧 🔗 https://github.com/modelscope/FunClip Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.Updated -
https://github.com/Picovoice/falcon On-device speaker diarization powered by deep learning
Updated -
🔧 🔗 https://github.com/markovka17/digit-recognition A small model for recognition of digits in audio clipsUpdated -
🔧 🔗 https://github.com/leon-ai/leon🧠 Leon is your open-source personal assistant.
Updated -
https://github.com/Picovoice/cobra On-device voice activity detection (VAD) powered by deep learning
Updated -
🔧 🔗 https://github.com/Cinnamon/whisper-jargon[SIGDIAL'24] Improving Speech Recognition with Jargon Injection
Updated -
https://github.com/Picovoice/web-voice-processor A library for real-time voice processing in web browsers.
Updated -
https://github.com/YuanGongND/ltu Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
Updated -
https://github.com/coqui-ai/STT
🐸 STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.Updated -
https://github.com/coqui-ai/stt-model-manager Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo
Updated -
https://github.com/coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech TechnologiesUpdated -
https://github.com/neonbjb/ocotillo Performant and accurate speech recognition built on Pytorch
Updated