Projects with this topic
-
https://github.com/mlc-ai/web-llm High-performance In-browser LLM Inference Engine
Updated -
🔧 🔗 https://github.com/flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Updated -
🔧 🔗 https://github.com/google/dotprompt Executable GenAI prompt templatesUpdated -
🔧 🔗 https://github.com/sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Updated -
🔧 🔗 https://github.com/tensorzero/tensorzero TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentationUpdated -
BAML
The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)
Updated -
🔧 🔗 https://github.com/microsoft/intelligence-toolkitInteractive workflows for creating AI intelligence reports from real-world data sources
Updated -
🔧 🔗 https://github.com/vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Updated -
🔧 🔗 https://github.com/BerriAI/litellmPython SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
🕸 ️🔗 docs.litellm.ai/docs/Updated -
Catwalk
🔧 🔗 https://github.com/charmbracelet/catwalk🐈 A collection of LLM inference providers and modelsUpdated -
OpenLIT is an open-source LLM Observability tool built on OpenTelemetry.
📈 🔥 Monitor GPU performance, LLM traces with input and output metadata, and metrics like cost, tokens, and user interactions along with complete APM for LLM Apps.🖥 ️Updated -
🔧 🔗 https://github.com/openai/codex Lightweight coding agent that runs in your terminalUpdated -
🔧 🔗 https://github.com/getzep/graphitiBuild and query dynamic, temporally-aware Knowledge Graphs
Updated -
🔧 🔗 https://github.com/Skyvern-AI/skyvernAutomate browser-based workflows with LLMs and Computer Vision
🕸 ️🔗 www.skyvern.com/Updated -
🔧 🔗 https://github.com/getzep/zepZep | The Memory Foundation For Your AI Stack
Updated -
https://github.com/joaomdmoura/crewAI Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
crewai.com
Updated -
LLM.c
LLM training in simple, raw C/CUDA LLMs in simple, pure C/CUDA with no need for 245MB of PyTorch or 107MB of cPython. Current focus is on pretraining, in particular reproducing the GPT-2 and GPT-3 miniseries, along with a parallel PyTorch ref
Updated -
🔧 🔗 https://github.com/open-webui/open-webui User-friendly AI Interface (Supports Ollama, OpenAI API, ...)Updated -
-
LLMs from scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
🔧 🔗 https://github.com/rasbt/LLMs-from-scratchUpdated