Skip to content
Snippets Groups Projects
Commit be79a2d4 authored by Sanyam Bhutani's avatar Sanyam Bhutani
Browse files

One more fix

parent 95db0fe4
No related branches found
No related tags found
No related merge requests found
......@@ -4,7 +4,7 @@
This is a guided series of tutorials/notebooks that can be taken as a reference or course to build a PDF to Podcast workflow.
You will also learn from experimentations of using Text to Speech Models.
You will also learn from the experiments of using Text to Speech Models.
It assumes zero knowledge of LLMs, prompting and audio models, everything is covered in their respective notebooks.
......
......@@ -97,7 +97,7 @@ Audio(speech_output[0].cpu().numpy(), rate=sampling_rate)```
Promising directions to explore in future:
- [MeloTTS](huggingface.co/myshell-ai/MeloTTS-English) This is most popular (ever) on HuggingFace
- [MeloTTS](https://huggingface.co/myshell-ai/MeloTTS-English) This is most popular (ever) on HuggingFace
- [WhisperSpeech](https://huggingface.co/WhisperSpeech/WhisperSpeech) sounded quite natural as well
- [F5-TTS](https://github.com/SWivid/F5-TTS) was the latest release at this time, however, it felt a bit robotic
- E2-TTS: r/locallama claims this to be a little better, however, it didn't pass the vibe test
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment