Projects with this topic
Sort by:
-
rustbpe
https://github.com/karpathy/rustbpe
The missing tiktoken training code
Updated -
Transphone
🔧 🔗 https://github.com/xinjli/transphone phoneme tokenizer and grapheme-to-phoneme model for 8k languagesUpdated