Projects with this topic
-
Ai Podcast generator
🔧 🔗 https://github.com/nbursa/ai-podcast-generator An end‑to‑end podcast generation pipeline that turns pre‑structured learning materials (PDF/JSON “content pillar”, notes, etcUpdated -
🔧 🔗 https://github.com/yandexdataschool/speech_courseYSDA course in Speech Processing.
Updated -
🔧 🔗 https://github.com/gpustack/vox-boxA text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
Updated -
https://github.com/homebrewltd/WhisperSpeech An Open Source text-to-speech system built by inverting Whisper.
🔗 https://collabora.github.io/WhisperSpeech/Updated -
🔧 🔗 https://github.com/lucidrains/e2-tts-pytorch Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in PytorchUpdated -
🔧 🔗 https://github.com/astramind-ai/AuralisA Fast TTS Engine
Updated -
https://github.com/huggingface/parler-tts Inference and training library for high-quality TTS models.
Updated -
🔧 🔗 https://github.com/modelscope/FunCodecFunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Updated -
https://github.com/neonbjb/tortoise-tts.git A multi-voice TTS system trained with an emphasis on quality
Updated -
https://github.com/lucidrains/voicebox-pytorch Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
Updated -
https://github.com/Camb-ai/MARS5-TTS MARS5 speech model (TTS) from CAMB.AI www.camb.ai
Updated -
https://github.com/coqui-ai/TTS
🐸 💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionUpdated -
-
https://github.com/coqui-ai/TTS-papers
🐸 collection of TTS papersUpdated -
https://github.com/coqui-ai/TTS-recipes
🐸 TTS recipes for different datasetsUpdated -
https://github.com/coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech TechnologiesUpdated -
AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models https://github.com/noco-ai/spellbook-docker/wiki
Updated