Projects with this topic
-
-
https://github.com/Picovoice/web-voice-processor A library for real-time voice processing in web browsers.
Updated -
🔧 🔗 https://github.com/bytedance/SALMONN SALMONN: Speech Audio Language Music Open Neural NetworkUpdated -
Sox Static Binaries
🔧 🔗 https://github.com/derhuerst/sox-static Static sox binaries for MacOS, Linux and Windows. SoX is the Swiss Army Knife of sound processing utilities. It can convert audio files to other popular audio file types and also apply sound effects and filters during the conversion. https://sourceforge.net/projects/sox/Updated -
🔧 🔗 https://github.com/gpustack/vox-boxA text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
Updated -
https://github.com/YuanGongND/ltu Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
Updated