Projects with this topic
-
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, persist, and execute on your own infrastructure. burr.dagworks.io
Updated -
https://github.com/InternLM/xtuner An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Updated -
🔧 🔗 https://github.com/vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Updated -
🔧 🔗 https://github.com/google/adk-python An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.Updated -
🔧 🔗 https://github.com/vllm-project/vllm-ascend Community maintained hardware plugin for vLLM on AscendUpdated -
https://github.com/InternLM/lmdeploy LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
🔗 lmdeploy.readthedocs.io/en/latest/Updated -
🔧 🔗 https://github.com/getzep/zepZep | The Memory Foundation For Your AI Stack
Updated -
🔧 🔗 https://github.com/flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Updated -
🔧 🔗 https://github.com/modelscope/ms-swiftSWIFT (Scalable lightWeight Infrastructure for Fine-Tuning) Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs.
Updated -
huggingface.co/transformers https://github.com/huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.Updated -
🔧 🔗 https://github.com/langfuse/langfuse🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LUpdated -
🔧 🔗 https://github.com/llmware-ai/llmwareUnified framework for building enterprise RAG pipelines with small, specialized models
Updated -
🔧 🔗 https://github.com/getzep/graphitiBuild and query dynamic, temporally-aware Knowledge Graphs
Updated -
🔧 🔗 https://github.com/gpustack/gpustackManage GPU clusters for running LLMs
Updated -
🔧 🔗 https://github.com/modelscope/modelscope-agentModelScope-Agent: An agent framework connecting models in ModelScope with the world
Updated -
🔧 🔗 https://github.com/containers/ramalamaRamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all
Updated -
https://github.com/mlc-ai/web-llm High-performance In-browser LLM Inference Engine
Updated -
🔧 🔗 https://github.com/EleutherAI/lm-evaluation-harness A framework for few-shot evaluation of language models.Updated -
🔧 🔗 https://github.com/google/ml-metrics Ml-metrics provides performant and distributed friendly ML metrics implementations.Updated -
🔧 🔗 https://github.com/modelscope/data-juicer Making data higher-quality, juicier, and more digestible for foundation models!🍎 🍋 🌽 ➡️ ➡️ 🍸 🍹 🍷 为大模型提供更高质量、更丰富、更易”消化“的数据!Updated