https://github.com/gpustack
🔧🔗https://github.com/gpustack/llama-box
LM inference server implementation based on llama.cpp.