🔧🔗https://github.com/google/gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.