Projects with this topic
Sort by:
-
llama server
🔧 🔗 https://github.com/maragudk/llama-server A simple layer on top of ghcr.io/ggml-org/llama.cpp:server to load GGUF models from assets.maragu.dev at runtime.Updated -
🔧 🔗 https://github.com/llmware-ai/llmwareUnified framework for building enterprise RAG pipelines with small, specialized models
Updated