Projects with this topic
-
huggingface.co/transformers https://github.com/huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.Updated -
🔧 🔗 https://github.com/vllm-project/vllm-ascend Community maintained hardware plugin for vLLM on AscendUpdated -
🔧 🔗 https://github.com/modelscope/ms-swiftSWIFT (Scalable lightWeight Infrastructure for Fine-Tuning) Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs.
Updated -
🔧 🔗 https://github.com/getzep/graphitiBuild and query dynamic, temporally-aware Knowledge Graphs
Updated -
🔧 🔗 https://github.com/mem0ai/mem0 Memory for AI Agents; SOTA in AI Agent Memory; Announcing OpenMemory MCP - local and secure memory management.Updated -
🔧 🔗 https://github.com/modelscope/data-juicer Making data higher-quality, juicier, and more digestible for foundation models!🍎 🍋 🌽 ➡️ ➡️ 🍸 🍹 🍷 为大模型提供更高质量、更丰富、更易”消化“的数据!Updated -
🔧 🔗 https://github.com/flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Updated -
SGPT
🔧 🔗 https://github.com/tbckr/sgptSGPT is a command-line tool that provides a convenient way to interact with OpenAI models, enabling users to run queries, generate shell commands and produce code directly from the terminal.
Updated -
🔧 🔗 https://github.com/containers/ramalamaRamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all
Updated -
https://github.com/joaomdmoura/crewAI Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
crewai.com
Updated -
🔧 🔗 https://github.com/vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Updated -
https://github.com/princeton-nlp/SWE-agent SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.
Updated -
🔧 🔗 https://github.com/gpustack/gpustackManage GPU clusters for running LLMs
Updated -
🔧 🔗 https://github.com/langfuse/langfuse🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LUpdated -
OpenLIT is an open-source LLM Observability tool built on OpenTelemetry.
📈 🔥 Monitor GPU performance, LLM traces with input and output metadata, and metrics like cost, tokens, and user interactions along with complete APM for LLM Apps.🖥 ️Updated -
🔧 🔗 https://github.com/modelscope/FunClip Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.Updated -
🔧 🔗 https://github.com/HKUDS/LightRAG"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Updated -
https://github.com/InternLM/lmdeploy LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
🔗 lmdeploy.readthedocs.io/en/latest/Updated -
🔧 🔗 https://github.com/Cinnamon/kotaemonAn open-source RAG-based tool for chatting with your documents.
🕸 ️🔗 https://cinnamon.github.io/kotaemon/Updated -
🔧 🔗 https://github.com/HKUDS/AI-Creator"AI-Creator: Multi-Modal Agents for Video Production"
Updated