G
gguf
Projects with this topic
-
https://github.com/janhq/nitro.git now: https://github.com/janhq/cortex.git Drop-in, local AI alternative to the OpenAI stack. Multi-engine (llama.cpp, TensorRT-LLM, ONNX). Powers
👋 JanUpdated -
🔧 🔗 https://github.com/gpustack/gguf-parser-goReview/Check GGUF files and estimate the memory usage and maximum tokens per second.
Updated -
🔧 🔗 https://github.com/gpustack/llama-boxLM inference server implementation based on llama.cpp.
Updated -
🔧 🔗 https://github.com/gpustack/gguf-packer-goDeliver LLMs of GGUF format via Dockerfile.
Updated -
https://github.com/janhq/cortex Drop-in, local AI alternative to the OpenAI stack. Multi-engine (llama.cpp, TensorRT-LLM, ONNX). Powers
👋 JanUpdated