Projects with this topic
-
🔧 🔗 https://github.com/IAHispano/ApplioA simple, high-quality voice conversion tool focused on ease of use and performance
Updated -
ebook to AudioBook
🔧 🔗 https://github.com/DrewThomasson/ebook2audiobook Generate audiobooks from e-books, voice cloning & 1107+ languages!Updated -
https://github.com/fishaudio/fish-speech Brand new TTS solution
Updated -
https://github.com/fishaudio/Bert-VITS2 vits2 backbone with multilingual-bert
Updated -
🔧 🔗 https://github.com/alphacep/vosk-ttsText To Speech Synthesis with Vosk
Updated -
ASR2k
🔧 🔗 https://github.com/xinjli/asr2k ASR2K: Speech Recognition for Around 2000 Languages without AudioUpdated -
🔧 🔗 https://github.com/yandexdataschool/speech_courseYSDA course in Speech Processing.
Updated -
🔧 🔗 https://github.com/gpustack/vox-boxA text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
Updated -
https://github.com/homebrewltd/WhisperSpeech An Open Source text-to-speech system built by inverting Whisper.
🔗 https://collabora.github.io/WhisperSpeech/Updated -
🔧 🔗 https://github.com/modelscope/FunCodecFunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
Updated -
https://github.com/coqui-ai/TTS
🐸 💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and productionUpdated -
https://github.com/coqui-ai/TTS-recipes
🐸 TTS recipes for different datasetsUpdated -
AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models https://github.com/noco-ai/spellbook-docker/wiki
Updated