GPUStack
Projects with this topic
-
-
-
🔧 🔗 https://github.com/gpustack/gpustack-uiUI for GPUStack
Updated -
🔧 🔗 https://github.com/gpustack/gguf-parser-goReview/Check GGUF files and estimate the memory usage and maximum tokens per second.
Updated -
🔧 🔗 https://github.com/gpustack/llama-boxLM inference server implementation based on llama.cpp.
Updated -
🔧 🔗 https://github.com/gpustack/gguf-packer-goDeliver LLMs of GGUF format via Dockerfile.
Updated