S
speech

Projects with this topic

View Speech To Speech project

mirrored_repos / MachineLearning / huggingface / Speech To Speech

🔧🔗https://github.com/huggingface/speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python Machine Lear... Synthetic In... speech speech-synth... assistant speech-to-text language-model speech-trans...

0

Updated Jun 09, 2026

0 0 0 0

Updated Jun 09, 2026
View Applio project

mirrored_repos / MachineLearning / IAHispano / Applio

🔧🔗https://github.com/IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance

Synthetic In... voice speech pytorch tts voice-conver... voice-cloning speech-to-sp... voice-clone Applio rvc sts vits

0

Updated Jun 08, 2026

0 0 0 0

Updated Jun 08, 2026
View Vosk Tts project

mirrored_repos / MachineLearning / alphacep / Vosk Tts

🔧🔗https://github.com/alphacep/vosk-tts

Text To Speech Synthesis with Vosk

text-to-speech tts vosk Python speech speech-synth...

0

Updated Jun 06, 2026

0 0 0 0

Updated Jun 06, 2026
View Pyannote Audio project

mirrored_repos / MachineLearning / pyannote / Pyannote Audio

pyannote audio
🔧🔗https://github.com/pyannote/pyannote-audio Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker em

speaker-reco... speaker-veri... speech-proce... voice-activi... overlapped-s... detection speech pytorch Python pretrained-m... speaker-diar... speech-activ... speaker-chan... stt speaker-embe... pyannote

0

Updated Jun 06, 2026

0 0 0 0

Updated Jun 06, 2026
View WhisperX project

mirrored_repos / MachineLearning / m-bain / WhisperX

🔧🔗https://github.com/m-bain/whisperX WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

speech speech-recog... speech-to-text whisper asr

0

Updated Jun 03, 2026

0 0 0 0

Updated Jun 03, 2026
View SALMONN project

mirrored_repos / MachineLearning / bytedance / SALMONN

🔧🔗https://github.com/bytedance/SALMONN SALMONN: Speech Audio Language Music Open Neural Network

audio music research speech speech-recog... multi-model audio-proces... tsinghua-uni... bytedance Large Langua... iclr2024 icml-2024

0

Updated May 26, 2026

0 0 0 0

Updated May 26, 2026
View Festvox project

mirrored_repos / festvox / Festvox

Festvox
🔧🔗https://github.com/festvox/festvox Festvox voice building tools

festival speech speech-synth... voice

0

Updated Mar 24, 2026

0 0 0 0

Updated Mar 24, 2026
View Modelscope project

mirrored_repos / MachineLearning / modelscope / Modelscope

🔧🔗https://github.com/modelscope/modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

nlp science speech computer-vision multi-modality Python Machine Lear... modelscope Deep Learning

0

Updated Dec 23, 2025

0 0 0 0

Updated Dec 23, 2025
View Ctc Segmentation project

mirrored_repos / MachineLearning / espnet / Ctc Segmentation

CTC Segmentation
🔧🔗https://github.com/espnet/ctc-segmentation Segment an audio file and obtain utterance alignments. (Python package)

espnet Python utterances speech segmentation

0

Updated Sep 03, 2025

0 0 0 0

Updated Sep 03, 2025
View Flite project

mirrored_repos / festvox / Flite

flite
🔧🔗https://github.com/festvox/flite

A small fast portable speech synthesis system

speech speech-synth... festvox festival cpp

0

Updated Sep 02, 2025

0 0 0 0

Updated Sep 02, 2025
View Festival project

mirrored_repos / festvox / Festival

Festival
🔧🔗https://github.com/festvox/festival

Festival Speech Synthesis System

festival voice voice-synthesis speech-synth... speech festvox

0

Updated Sep 02, 2025

0 0 0 0

Updated Sep 02, 2025
View Speech Tools project

mirrored_repos / festvox / Speech Tools

Edinburgh Speech Tools
🔧🔗https://github.com/festvox/speech_tools CMU Edinburgh Speech Tools

voice voice-synthesis speech speech-synth... speech-toolkit festival festvox

0

Updated Sep 02, 2025

0 0 0 0

Updated Sep 02, 2025
View Ucla Phonetic Corpus project

mirrored_repos / MachineLearning / xinjli / Ucla Phonetic Corpus

UCLA Phonetic Corpus
🔧🔗https://github.com/xinjli/ucla-phonetic-corpus

Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION

speech-recog... speech dataset phonetics

0

Updated Sep 02, 2025

0 0 0 0

Updated Sep 02, 2025
View Alqalign project

mirrored_repos / MachineLearning / xinjli / Alqalign

alqalign
🔧🔗https://github.com/xinjli/alqalign multilingual speech aligner

speech alignment speech-toolkit toolkit phoneme

0

Updated Sep 02, 2025

0 0 0 0

Updated Sep 02, 2025
View Allosaurus project

mirrored_repos / MachineLearning / xinjli / Allosaurus

Allosaurus
🔧🔗https://github.com/xinjli/allosaurus Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

speech pytorch speech-recog... phonetics

0

Updated Sep 02, 2025

0 0 0 0

Updated Sep 02, 2025
View Speechbrain.Github.Io project

mirrored_repos / MachineLearning / speechbrain / Speechbrain.Github.Io

🔧🔗https://github.com/speechbrain/speechbrain.github.io The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Deep Learning neural-network speech speech-recog... speech-to-text speaker-reco... speaker-veri... speech-proce... speech-recog... beamforming speech-analysis speech-recog... speech-separ... speech-emoti... speaker-iden... Documentation website speechbrain

0

Updated Jun 18, 2025

0 0 0 0

Updated Jun 18, 2025
View LJSpeechTools project

mirrored_repos / MachineLearning / elizaOS / LJSpeechTools

🔧🔗https://github.com/elizaOS/LJSpeechTools

Tools for making LJSpeech datasets

speech Python elizaOS

0

Updated Feb 13, 2025

0 0 0 0

Updated Feb 13, 2025
View AudioBench project

mirrored_repos / MachineLearning / homebrewltd / AudioBench

https://github.com/homebrewltd/AudioBench AudioBench: A Universal Benchmark for Audio Large Language Models 🔗 https://arxiv.org/abs/2406.16020

speech speech-recog... speech-quest... audio-scene-...

0

Updated Jan 16, 2025

0 0 0 0

Updated Jan 16, 2025
View Bark project

mirrored_repos / MachineLearning / Suno-ai / Bark

🔧🔗https://github.com/suno-ai/bark Bark is a transformer-based text-to-audio model created by Suno

text-to-audio Machine Lear... generative-ai audio speech

0

Updated Oct 07, 2024

0 0 0 0

Updated Oct 07, 2024
View MARS5 TTS project

mirrored_repos / MachineLearning / Camb-ai / MARS5 TTS

https://github.com/Camb-ai/MARS5-TTS MARS5 speech model (TTS) from CAMB.AI www.camb.ai

speech tts speech-synth... prosody voice-cloning voice-cloneai

0

Updated Aug 01, 2024

0 0 0 0

Updated Aug 01, 2024

🐾❤️ Strive to be the person your dogs believe you are ❤️🐾