Projects with this topic
-
-
-
🔧 🔗 https://github.com/modelscope/FunASR A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processingUpdated -
🔧 🔗 https://github.com/modelscope/FunClip Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.Updated -
https://github.com/homebrewltd/AudioBench AudioBench: A Universal Benchmark for Audio Large Language Models
🔗 https://arxiv.org/abs/2406.16020Updated -
🔧 🔗 https://github.com/markovka17/digit-recognition A small model for recognition of digits in audio clipsUpdated -
https://github.com/YuanGongND/ltu Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
Updated -
https://github.com/coqui-ai/STT
🐸 STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.Updated