G
gguf

Projects with this topic

View Llama.Cpp project

mirrored_repos / MachineLearning / ggeranov / Llama.Cpp

https://github.com/ggml-org/llama.cpp LLM inference in C/C++

ggml gguf cpp Llama

0

Updated Apr 30, 2026

0 0 0 0

Updated Apr 30, 2026
View Gguf Parser Go project

mirrored_repos / MachineLearning / GPUStack / Gguf Parser Go

🔧🔗https://github.com/gpustack/gguf-parser-go

Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

GoLang gguf telemetry GPUStack go

0

Updated Mar 25, 2026

0 0 0 0

Updated Mar 25, 2026
View Nitro project

mirrored_repos / MachineLearning / menloresearch / Nitro

https://github.com/janhq/nitro.git now: https://github.com/janhq/cortex.git Drop-in, local AI alternative to the OpenAI stack. Multi-engine (llama.cpp, TensorRT-LLM, ONNX). Powers 👋 Jan

Synthetic In... cuda Llama accellerated inference-en... openai-api Large Langua... stable-diffu... llamacpp llama2 llama3 gguf tensorrt-llm

0

Updated Jul 04, 2025

0 0 0 0

Updated Jul 04, 2025
View Llama Box project

mirrored_repos / MachineLearning / GPUStack / Llama Box

🔧🔗https://github.com/gpustack/llama-box

LM inference server implementation based on llama.cpp.

cpp transformer diffusion gguf openai-api GPUStack

0

Updated Apr 22, 2025

0 0 0 0

Updated Apr 22, 2025
View Gguf Packer Go project

mirrored_repos / MachineLearning / GPUStack / Gguf Packer Go

🔧🔗https://github.com/gpustack/gguf-packer-go

Deliver LLMs of GGUF format via Dockerfile.

gguf packer GoLang go Large Langua... GPUStack

0

Updated Nov 30, 2024

0 0 0 0

Updated Nov 30, 2024

🐾❤️ Strive to be the person your dogs believe you are ❤️🐾