S
speech-to-text

Projects with this topic

View Speech To Speech project

mirrored_repos / MachineLearning / huggingface / Speech To Speech

🔧🔗https://github.com/huggingface/speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python Machine Lear... Synthetic In... speech speech-synth... assistant speech-to-text language-model speech-trans...

0

Updated Jun 12, 2026

0 0 0 0

Updated Jun 12, 2026
View FunClip project

mirrored_repos / MachineLearning / modelscope / FunClip

🔧🔗https://github.com/modelscope/FunClip Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recog... speech-to-text gradio video-clip subtitles-ge... Large Langua... Python modelscope

0

Updated Jun 12, 2026

0 0 0 0

Updated Jun 12, 2026
View Cheetah project

mirrored_repos / MachineLearning / Picovoice / Cheetah

https://github.com/Picovoice/cheetah On-device streaming speech-to-text engine powered by deep learning

voice-activa... speech-recog... automatic-sp... speech-to-text transcription stt asr online-speec... streaming-sp...

0

Updated Jun 10, 2026

0 0 0 0

Updated Jun 10, 2026
View Leon project

mirrored_repos / MachineLearning / leon-ai / Leon

🔧🔗https://github.com/leon-ai/leon

🧠 Leon is your open-source personal assistant.

🔗https://getleon.ai

NodeJS Python bot tts automation privacy offline chatbot Synthetic In... speech-synth... ai-assistant assistant speech-recog... personal-ass... speech-to-text leon flite voice-assistant virtual-assi...

0

Updated Jun 07, 2026

0 0 0 0

Updated Jun 07, 2026
View Vosk Api project

mirrored_repos / MachineLearning / alphacep / Vosk Api

[🔧🔗https://github.com/alphacep/vosk-api](https://github.com/alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

vosk speaker-iden... deepspeech speech-to-text Python android raspberry-pi ios privacy deep-neural-... Deep Learning offline voice-recogn... speech-recog... kaldi stt speaker-veri... asr

0

Updated Jun 04, 2026

0 0 0 0

Updated Jun 04, 2026
View WhisperX project

mirrored_repos / MachineLearning / m-bain / WhisperX

🔧🔗https://github.com/m-bain/whisperX WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

speech speech-recog... speech-to-text whisper asr

0

Updated Jun 03, 2026

0 0 0 0

Updated Jun 03, 2026
View Speechbrain project

mirrored_repos / MachineLearning / speechbrain / Speechbrain

🔧🔗https://github.com/speechbrain/speechbrain A PyTorch-based Speech Toolkit

audio speechbrain Deep Learning transformers Python pytorch voice-recogn... speech-recog... speech-to-text Large Langua... speaker-reco... speaker-veri... speech-proce... audio-proces... asr speaker-diar... speech-separ... speech-enhan... spoken-langu... speech-toolkit

0

Updated May 27, 2026

0 0 0 0

Updated May 27, 2026
View Vox Box project

mirrored_repos / MachineLearning / GPUStack / Vox Box

🔧🔗https://github.com/gpustack/vox-box

A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

Python tts speech-to-text stt audio-proces... asr openai-api

0

Updated Dec 23, 2025

0 0 0 0

Updated Dec 23, 2025
View Faster Whisper project

mirrored_repos / MachineLearning / SYSTRAN / Faster Whisper

🔧🔗https://github.com/SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2

Deep Learning inference transformer OpenAI quantization whisper speech-recog... speech-to-text

0

Updated Nov 19, 2025

0 0 0 0

Updated Nov 19, 2025
View FunCodec project

mirrored_repos / MachineLearning / modelscope / FunCodec

🔧🔗https://github.com/modelscope/FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

tts speech-synth... codec speech-to-text audio-genera... encodec voice-cloning audio-quanti... modelscope

0

Updated Dec 08, 2024

0 0 0 0

Updated Dec 08, 2024
View STT project

mirrored_repos / MachineLearning / coqui-ai / STT

https://github.com/coqui-ai/STT 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

Deep Learning tensorflow voice-recogn... speech-recog... automatic-sp... speech-to-text stt asr speech-recog... speech-recog...

0

Updated Jun 09, 2024

0 0 0 0

Updated Jun 09, 2024
View STT Examples project

mirrored_repos / MachineLearning / coqui-ai / STT Examples

https://github.com/coqui-ai/STT-examples 🐸STT integration examples github.com/coqui-ai/STT

stt speech-to-text examples

0

Updated Jun 09, 2024

0 0 0 0

Updated Jun 09, 2024

🐾❤️ Strive to be the person your dogs believe you are ❤️🐾