Large Language Models
Projects with this topic
-
https://github.com/janhq/cortex Drop-in, local AI alternative to the OpenAI stack. Multi-engine (llama.cpp, TensorRT-LLM, ONNX). Powers
👋 JanUpdated -
🔧 🔗 https://github.com/llmware-ai/llmwareUnified framework for building enterprise RAG pipelines with small, specialized models
[
🕸 🔗 https://llmware-ai.github.io/llmware/](https://llmware-ai.github.io/llmwaUpdated -
https://github.com/mistralai/mistral-inference Official inference library for Mistral models
mistral.ai/
Updated -
🔧 🔗 https://github.com/poloclub/mememoA JavaScript library that brings vector search and RAG to your browser!
Updated -
-
https://github.com/InternLM/xtuner An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Updated -
Domain Adapted Language Modeling Toolkit - E2E RAG
Updated -
https://github.com/topoteretes/PromethAI-Backend Open-source framework that gives you AI Agents that help you navigate decision-making, get personalized goals and execute them
Updated -
🔧 🔗 https://github.com/getzep/zepclizepcli - a command line tool for managing the Zep service.
Updated -
https://github.com/mlc-ai/web-llm-chat Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.
Updated -
https://github.com/janhq/cortex.tensorrt-llm Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.
Updated -
https://github.com/THUDM/LongWriter LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Updated -
https://github.com/InternLM/HuixiangDou HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance
Updated -
https://github.com/InternLM/InternLM-Math State-of-the-art bilingual open-sourced Math reasoning LLMs.
Updated -
🔧 🔗 https://github.com/FoundationVision/LlamaGenAutoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
🔗 arxiv.org/abs/2406.06525Updated -
🔧 🔗 https://github.com/FoundationVision/Groma[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
Updated -
🔧 🔗 https://github.com/AlexCheema/veritasVeritas is a library for verifying inference of large ML models on Mina.
Updated -
https://github.com/THUDM/LongCite LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Updated -
-
🔧 🔗 https://github.com/IST-DASLab/marlin FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.Updated