S
speech-recognition

Projects with this topic

View Transformers project

mirrored_repos / MachineLearning / huggingface / Transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
huggingface.co/transformers https://github.com/huggingface/transformers

Python nlp Machine Lear... natural-lang... Deep Learning tensorflow pytorch transformer speech-recog... seq2seq flax pretrained-m... language-model Large Langua... nlp-library bert jax pytorch-tran... model-hub

0

Updated Jul 13, 2026

0 0 0 0

Updated Jul 13, 2026
View FunASR project

mirrored_repos / MachineLearning / modelscope / FunASR

🔧🔗https://github.com/modelscope/FunASR A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing

pytorch Python modelscope speech-recog... vad punctuation whisper audio-visual... speaker-diar... voice-activi... conformer pretrained-m... rnnt dfsmn paraformer speechgpt speechllm

0

Updated Jul 13, 2026

0 0 0 0

Updated Jul 13, 2026
View Leon project

mirrored_repos / MachineLearning / leon-ai / Leon

🔧🔗https://github.com/leon-ai/leon

🧠 Leon is your open-source personal assistant.

🔗https://getleon.ai

NodeJS Python bot tts automation privacy offline chatbot Synthetic In... speech-synth... ai-assistant assistant speech-recog... personal-ass... speech-to-text leon flite voice-assistant virtual-assi...

0

Updated Jul 12, 2026

0 0 0 0

Updated Jul 12, 2026
View Rhino project

mirrored_repos / MachineLearning / Picovoice / Rhino

https://github.com/Picovoice/rhino On-device Speech-to-Intent engine powered by deep learning

entity-resol... nlu voice-commands voice-recogn... speech-recog... voice-control voice-command slot-filling voice-assistant natural-lang... slu vui on-device spoken-langu... voice-ui voice-user-i... intent-infer... voice-comman...

0

Updated Jul 10, 2026

0 0 0 0

Updated Jul 10, 2026
View Cheetah project

mirrored_repos / MachineLearning / Picovoice / Cheetah

https://github.com/Picovoice/cheetah On-device streaming speech-to-text engine powered by deep learning

voice-activa... speech-recog... automatic-sp... speech-to-text transcription stt asr online-speec... streaming-sp...

0

Updated Jul 10, 2026

0 0 0 0

Updated Jul 10, 2026
View Espnet project

mirrored_repos / MachineLearning / espnet / Espnet

ESPNet
🔧🔗https://github.com/espnet/espnet End-to-End Speech Processing Toolkit

text-to-speech Deep Learning chainer end-to-end machine-tran... pytorch Python speech-synth... speech-recog... kaldi voice-conver... speaker-diar... speech-separ... speech-enhan... spoken-langu... speech-trans... singing-voic... singing espnet

0

Updated Jul 10, 2026

0 0 0 0

Updated Jul 10, 2026
View FunClip project

mirrored_repos / MachineLearning / modelscope / FunClip

🔧🔗https://github.com/modelscope/FunClip Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recog... speech-to-text gradio video-clip subtitles-ge... Large Langua... Python modelscope

0

Updated Jul 07, 2026

0 0 0 0

Updated Jul 07, 2026
View Porcupine project

mirrored_repos / MachineLearning / Picovoice / Porcupine

https://github.com/Picovoice/porcupine On-device wake word detection powered by deep learning

speech-recog... hotword-dete... keyword-spot... handsfree wake-word-de... on-device hotword-dete... hotword trigger-word... keyword-spotter wake-word voice-activa... wake-word-en...

0

Updated Jul 04, 2026

0 0 0 0

Updated Jul 04, 2026
View Cobra project

mirrored_repos / MachineLearning / Picovoice / Cobra

https://github.com/Picovoice/cobra On-device voice activity detection (VAD) powered by deep learning

speech-recog... vad voice-activi... on-device voice-activity voice-activi...

0

Updated Jul 03, 2026

0 0 0 0

Updated Jul 03, 2026
View Vosk Api project

mirrored_repos / MachineLearning / alphacep / Vosk Api

[🔧🔗https://github.com/alphacep/vosk-api](https://github.com/alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

vosk speaker-iden... deepspeech speech-to-text Python android raspberry-pi ios privacy deep-neural-... Deep Learning offline voice-recogn... speech-recog... kaldi stt speaker-veri... asr

0

Updated Jul 02, 2026

0 0 0 0

Updated Jul 02, 2026
View Falcon project

mirrored_repos / MachineLearning / Picovoice / Falcon

https://github.com/Picovoice/falcon On-device speaker diarization powered by deep learning

on-device diarization speech-recog...

0

Updated Jul 02, 2026

0 0 0 0

Updated Jul 02, 2026
View WhisperX project

mirrored_repos / MachineLearning / m-bain / WhisperX

🔧🔗https://github.com/m-bain/whisperX WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

speech speech-recog... speech-to-text whisper asr

0

Updated Jun 26, 2026

0 0 0 0

Updated Jun 26, 2026
View Web Voice Processor project

mirrored_repos / MachineLearning / Picovoice / Web Voice Processor

https://github.com/Picovoice/web-voice-processor A library for real-time voice processing in web browsers.

javascript real-time browser worker realtime voice-commands microphone speech-recog... webaudio-api pcm web-browser speech-to-text audio-proces... wake-word-de... downsampling voice-proces...

0

Updated Jun 23, 2026

0 0 0 0

Updated Jun 23, 2026
View Speechbrain project

mirrored_repos / MachineLearning / speechbrain / Speechbrain

🔧🔗https://github.com/speechbrain/speechbrain A PyTorch-based Speech Toolkit

audio speechbrain Deep Learning transformers Python pytorch voice-recogn... speech-recog... speech-to-text Large Langua... speaker-reco... speaker-veri... speech-proce... audio-proces... asr speaker-diar... speech-separ... speech-enhan... spoken-langu... speech-toolkit

0

Updated Jun 15, 2026

0 0 0 0

Updated Jun 15, 2026
View SALMONN project

mirrored_repos / MachineLearning / bytedance / SALMONN

🔧🔗https://github.com/bytedance/SALMONN SALMONN: Speech Audio Language Music Open Neural Network

audio music research speech speech-recog... multi-model audio-proces... tsinghua-uni... bytedance Large Langua... iclr2024 icml-2024

0

Updated May 26, 2026

0 0 0 0

Updated May 26, 2026
View Whisper project

mirrored_repos / MachineLearning / openai / Whisper

🔧🔗https://github.com/openai/whisper Robust Speech Recognition via Large-Scale Weak Supervision

speech-recog... OpenAI whisper stt tts

0

Updated Apr 15, 2026

0 0 0 0

Updated Apr 15, 2026
View Awesome Russian Speech project

mirrored_repos / MachineLearning / alphacep / Awesome Russian Speech

🔧🔗https://github.com/alphacep/awesome-russian-speech

Russian speech technology links

tts awesome-list speech-synth... speech-recog... speech-to-text asr vosk russian

0

Updated Mar 24, 2026

0 0 0 0

Updated Mar 24, 2026
View Vosk Unity Asr project

mirrored_repos / MachineLearning / alphacep / Vosk Unity Asr

🔧🔗https://github.com/alphacep/vosk-unity-asr Automatic Speech Recognition in Unity using Vosk library

unity3d speech-recog... stt asr deepspeech vosk cpp

0

Updated Mar 24, 2026

0 0 0 0

Updated Mar 24, 2026
View Dla project

mirrored_repos / MachineLearning / markovka17 / Dla

🔧🔗https://github.com/markovka17/dla

Deep learning for audio processing

Deep Learning signal-proce... tts speech-recog... speaker-veri...

0

Updated Dec 15, 2025

0 0 0 0

Updated Dec 15, 2025
View Faster Whisper project

mirrored_repos / MachineLearning / SYSTRAN / Faster Whisper

🔧🔗https://github.com/SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

Deep Learning inference transformer OpenAI quantization whisper speech-recog... speech-to-text

0

Updated Nov 19, 2025

0 0 0 0

Updated Nov 19, 2025

🐾❤️ Strive to be the person your dogs believe you are ❤️🐾