Projects with this topic
-
🔧 🔗 https://github.com/vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Updated -
🔧 🔗 https://github.com/langfuse/langfuse🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LUpdated -
Crush
🔧 🔗 https://github.com/charmbracelet/crush The glamourous AI coding agent for your favourite terminal💘 Updated -
🔧 🔗 https://github.com/modelscope/ms-swiftSWIFT (Scalable lightWeight Infrastructure for Fine-Tuning) Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs.
Updated -
🔧 🔗 https://github.com/flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Updated -
🔧 🔗 https://github.com/vllm-project/vllm-ascend Community maintained hardware plugin for vLLM on AscendUpdated -
🔧 🔗 https://github.com/getzep/graphitiBuild and query dynamic, temporally-aware Knowledge Graphs
Updated -
Dyad
🔧 🔗 https://github.com/dyad-sh/dyad Free, local, open-source AI app builder✨ v0 / lovable / Bolt alternative🌟 Star if you like it!Updated -
🔧 🔗 https://github.com/All-Hands-AI/OpenHands🙌 OpenHands: Code Less, Make MoreUpdated -
🔧 🔗 https://github.com/containers/ramalamaRamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all
Updated -
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, persist, and execute on your own infrastructure. burr.dagworks.io
Updated -
🔧 🔗 https://github.com/google/dotprompt Executable GenAI prompt templatesUpdated -
🔧 🔗 https://github.com/gpustack/gpustackManage GPU clusters for running LLMs
Updated -
rust sdk
https://github.com/agentclientprotocol/rust-sdk
Rust SDK for ACP clients and agents.
Updated -
🔧 🔗 https://github.com/google/ml-metrics Ml-metrics provides performant and distributed friendly ML metrics implementations.Updated -
huggingface.co/transformers https://github.com/huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.Updated -
🔧 🔗 https://github.com/google/adk-python An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.Updated -
🔧 🔗 https://github.com/EleutherAI/lm-evaluation-harness A framework for few-shot evaluation of language models.Updated -
OpenLIT is an open-source LLM Observability tool built on OpenTelemetry.
📈 🔥 Monitor GPU performance, LLM traces with input and output metadata, and metrics like cost, tokens, and user interactions along with complete APM for LLM Apps.🖥 ️Updated -
https://github.com/InternLM/lmdeploy LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
🔗 lmdeploy.readthedocs.io/en/latest/Updated