Projects with this topic
Sort by:
-
🔧 🔗 https://github.com/containers/ramalamaRamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all
Updated -
🔧 🔗 https://github.com/llmware-ai/llmwareUnified framework for building enterprise RAG pipelines with small, specialized models
Updated -
llama server
🔧 🔗 https://github.com/maragudk/llama-server A simple layer on top of ghcr.io/ggml-org/llama.cpp:server to load GGUF models from assets.maragu.dev at runtime.Updated