Projects with this topic
Sort by:
-
🔧 🔗 https://github.com/andrewkchan/yalmYet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O
Updated
Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O