S
speech-to-text

Projects with this topic

View Speech To Speech project

mirrored_repos / MachineLearning / huggingface / Speech To Speech

🔧🔗https://github.com/huggingface/speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python Machine Lear... Synthetic In... speech speech-synth... assistant speech-to-text language-model speech-trans...

0

Updated Jun 12, 2026

0 0 0 0

Updated Jun 12, 2026
View FunClip project

mirrored_repos / MachineLearning / modelscope / FunClip

🔧🔗https://github.com/modelscope/FunClip Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recog... speech-to-text gradio video-clip subtitles-ge... Large Langua... Python modelscope

0

Updated Jun 12, 2026

0 0 0 0

Updated Jun 12, 2026
View Cheetah project

mirrored_repos / MachineLearning / Picovoice / Cheetah

https://github.com/Picovoice/cheetah On-device streaming speech-to-text engine powered by deep learning

voice-activa... speech-recog... automatic-sp... speech-to-text transcription stt asr online-speec... streaming-sp...

0

Updated Jun 10, 2026

0 0 0 0

Updated Jun 10, 2026
View Web Voice Processor project

mirrored_repos / MachineLearning / Picovoice / Web Voice Processor

https://github.com/Picovoice/web-voice-processor A library for real-time voice processing in web browsers.

javascript real-time browser worker realtime voice-commands microphone speech-recog... webaudio-api pcm web-browser speech-to-text audio-proces... wake-word-de... downsampling voice-proces...

0

Updated Jun 10, 2026

0 0 0 0

Updated Jun 10, 2026
View Leon project

mirrored_repos / MachineLearning / leon-ai / Leon

🔧🔗https://github.com/leon-ai/leon

🧠 Leon is your open-source personal assistant.

🔗https://getleon.ai

NodeJS Python bot tts automation privacy offline chatbot Synthetic In... speech-synth... ai-assistant assistant speech-recog... personal-ass... speech-to-text leon flite voice-assistant virtual-assi...

0

Updated Jun 07, 2026

0 0 0 0

Updated Jun 07, 2026
View Vosk Api project

mirrored_repos / MachineLearning / alphacep / Vosk Api

[🔧🔗https://github.com/alphacep/vosk-api](https://github.com/alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

vosk speaker-iden... deepspeech speech-to-text Python android raspberry-pi ios privacy deep-neural-... Deep Learning offline voice-recogn... speech-recog... kaldi stt speaker-veri... asr

0

Updated Jun 04, 2026

0 0 0 0

Updated Jun 04, 2026
View WhisperX project

mirrored_repos / MachineLearning / m-bain / WhisperX

🔧🔗https://github.com/m-bain/whisperX WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

speech speech-recog... speech-to-text whisper asr

0

Updated Jun 03, 2026

0 0 0 0

Updated Jun 03, 2026
View Speechbrain project

mirrored_repos / MachineLearning / speechbrain / Speechbrain

🔧🔗https://github.com/speechbrain/speechbrain A PyTorch-based Speech Toolkit

audio speechbrain Deep Learning transformers Python pytorch voice-recogn... speech-recog... speech-to-text Large Langua... speaker-reco... speaker-veri... speech-proce... audio-proces... asr speaker-diar... speech-separ... speech-enhan... spoken-langu... speech-toolkit

0

Updated May 27, 2026

0 0 0 0

Updated May 27, 2026
View Awesome Russian Speech project

mirrored_repos / MachineLearning / alphacep / Awesome Russian Speech

🔧🔗https://github.com/alphacep/awesome-russian-speech

Russian speech technology links

tts awesome-list speech-synth... speech-recog... speech-to-text asr vosk russian

0

Updated Mar 24, 2026

0 0 0 0

Updated Mar 24, 2026
View Vox Box project

mirrored_repos / MachineLearning / GPUStack / Vox Box

🔧🔗https://github.com/gpustack/vox-box

A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

Python tts speech-to-text stt audio-proces... asr openai-api

0

Updated Dec 23, 2025

0 0 0 0

Updated Dec 23, 2025
View Faster Whisper project

mirrored_repos / MachineLearning / SYSTRAN / Faster Whisper

🔧🔗https://github.com/SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

Deep Learning inference transformer OpenAI quantization whisper speech-recog... speech-to-text

0

Updated Nov 19, 2025

0 0 0 0

Updated Nov 19, 2025
View Speechbrain.Github.Io project

mirrored_repos / MachineLearning / speechbrain / Speechbrain.Github.Io

🔧🔗https://github.com/speechbrain/speechbrain.github.io The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Deep Learning neural-network speech speech-recog... speech-to-text speaker-reco... speaker-veri... speech-proce... speech-recog... beamforming speech-analysis speech-recog... speech-separ... speech-emoti... speaker-iden... Documentation website speechbrain

0

Updated Jun 18, 2025

0 0 0 0

Updated Jun 18, 2025
View FunCodec project

mirrored_repos / MachineLearning / modelscope / FunCodec

🔧🔗https://github.com/modelscope/FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

tts speech-synth... codec speech-to-text audio-genera... encodec voice-cloning audio-quanti... modelscope

0

Updated Dec 08, 2024

0 0 0 0

Updated Dec 08, 2024
View STT project

mirrored_repos / MachineLearning / coqui-ai / STT

https://github.com/coqui-ai/STT 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

Deep Learning tensorflow voice-recogn... speech-recog... automatic-sp... speech-to-text stt asr speech-recog... speech-recog...

0

Updated Jun 09, 2024

0 0 0 0

Updated Jun 09, 2024
View STT Models project

mirrored_repos / MachineLearning / coqui-ai / STT Models

https://github.com/coqui-ai/STT-models Open models for Coqui STT coqui.ai

Deep Learning Large Langua... pretrained-m... language-model models speech-to-text stt

0

Updated Jun 09, 2024

0 0 0 0

Updated Jun 09, 2024
View STT Examples project

mirrored_repos / MachineLearning / coqui-ai / STT Examples

https://github.com/coqui-ai/STT-examples 🐸STT integration examples github.com/coqui-ai/STT

stt speech-to-text examples

0

Updated Jun 09, 2024

0 0 0 0

Updated Jun 09, 2024
View Open Speech Corpora project

mirrored_repos / MachineLearning / coqui-ai / Open Speech Corpora

https://github.com/coqui-ai/open-speech-corpora 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

voice-recogn... speech-recog... speech-to-text stt speech-proce... voice-activi... speech-separ... speech-emoti... voice-cloning tts speech-synth...

0

Updated Jun 09, 2024

0 0 0 0

Updated Jun 09, 2024
View Leopard project

mirrored_repos / MachineLearning / Picovoice / Leopard

https://github.com/Picovoice/leopard

On-device speech-to-text engine powered by deep learning

voice-recogn... speech-recog... automatic-sp... speech-to-text transcription stt asr voice-to-text on-device

0

Updated May 13, 2024

0 0 0 0

Updated May 13, 2024

🐾❤️ Strive to be the person your dogs believe you are ❤️🐾