S
speech-recognition

Projects with this topic

View Ucla Phonetic Corpus project

mirrored_repos / MachineLearning / xinjli / Ucla Phonetic Corpus

UCLA Phonetic Corpus
🔧🔗https://github.com/xinjli/ucla-phonetic-corpus

Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION

speech-recog... speech dataset phonetics

0

Updated Sep 02, 2025

0 0 0 0

Updated Sep 02, 2025
View Allosaurus project

mirrored_repos / MachineLearning / xinjli / Allosaurus

Allosaurus
🔧🔗https://github.com/xinjli/allosaurus Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

speech pytorch speech-recog... phonetics

0

Updated Sep 02, 2025

0 0 0 0

Updated Sep 02, 2025
View Vosk Server project

mirrored_repos / MachineLearning / alphacep / Vosk Server

🔧🔗https://github.com/alphacep/vosk-server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Python websocket webrtc grpc saas speech-recog... kaldi asr vosk server

0

Updated Jul 25, 2025

0 0 0 0

Updated Jul 25, 2025
View FunClip project

mirrored_repos / MachineLearning / modelscope / FunClip

🔧🔗https://github.com/modelscope/FunClip Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recog... speech-to-text gradio video-clip subtitles-ge... Large Langua... Python modelscope

0

Updated Jul 11, 2025

0 0 0 0

Updated Jul 11, 2025
View Speechbrain.Github.Io project

mirrored_repos / MachineLearning / speechbrain / Speechbrain.Github.Io

🔧🔗https://github.com/speechbrain/speechbrain.github.io The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Deep Learning neural-network speech speech-recog... speech-to-text speaker-reco... speaker-veri... speech-proce... speech-recog... beamforming speech-analysis speech-recog... speech-separ... speech-emoti... speaker-iden... Documentation website speechbrain

0

Updated Jun 18, 2025

0 0 0 0

Updated Jun 18, 2025
View AudioBench project

mirrored_repos / MachineLearning / homebrewltd / AudioBench

https://github.com/homebrewltd/AudioBench AudioBench: A Universal Benchmark for Audio Large Language Models 🔗 https://arxiv.org/abs/2406.16020

speech speech-recog... speech-quest... audio-scene-...

0

Updated Jan 16, 2025

0 0 0 0

Updated Jan 16, 2025
View Digit Recognition project

mirrored_repos / MachineLearning / markovka17 / Digit Recognition

🔧🔗https://github.com/markovka17/digit-recognition A small model for recognition of digits in audio clips

pytorch speech-recog... jasper ctc

0

Updated Dec 04, 2024

0 0 0 0

Updated Dec 04, 2024
View Whisper Jargon project

mirrored_repos / MachineLearning / Cinnamon / Whisper Jargon

🔧🔗https://github.com/Cinnamon/whisper-jargon

[SIGDIAL'24] Improving Speech Recognition with Jargon Injection

🕸️🔗https://aclanthology.org/2024.sigdial-1.42/

speech-recog... Python whisper sigdial Synthetic In... academic

0

Updated Oct 23, 2024

0 0 0 0

Updated Oct 23, 2024
View Ltu project

mirrored_repos / MachineLearning / YuanGongND / Ltu

https://github.com/YuanGongND/ltu Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

audio Deep Learning speech-recog... audio-proces... Large Langua...

0

Updated Aug 17, 2024

0 0 0 0

Updated Aug 17, 2024
View STT project

mirrored_repos / MachineLearning / coqui-ai / STT

https://github.com/coqui-ai/STT 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

Deep Learning tensorflow voice-recogn... speech-recog... automatic-sp... speech-to-text stt asr speech-recog... speech-recog...

0

Updated Jun 09, 2024

0 0 0 0

Updated Jun 09, 2024
View Stt Model Manager project

mirrored_repos / MachineLearning / coqui-ai / Stt Model Manager

https://github.com/coqui-ai/stt-model-manager Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo

react Python flask websocket speech-recog... stt coqui-ai

0

Updated Jun 09, 2024

0 0 0 0

Updated Jun 09, 2024
View Open Speech Corpora project

mirrored_repos / MachineLearning / coqui-ai / Open Speech Corpora

https://github.com/coqui-ai/open-speech-corpora 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

voice-recogn... speech-recog... speech-to-text stt speech-proce... voice-activi... speech-separ... speech-emoti... voice-cloning tts speech-synth...

0

Updated Jun 09, 2024

0 0 0 0

Updated Jun 09, 2024
View Ocotillo project

mirrored_repos / MachineLearning / neonbjb / Ocotillo

https://github.com/neonbjb/ocotillo Performant and accurate speech recognition built on Pytorch

speech-recog... pytorch wav2vec2

0

Updated May 31, 2024

0 0 0 0

Updated May 31, 2024
View Leopard project

mirrored_repos / MachineLearning / Picovoice / Leopard

https://github.com/Picovoice/leopard

On-device speech-to-text engine powered by deep learning

voice-recogn... speech-recog... automatic-sp... speech-to-text transcription stt asr voice-to-text on-device

0

Updated May 13, 2024

0 0 0 0

Updated May 13, 2024

🐾❤️ Strive to be the person your dogs believe you are ❤️🐾