V
vllm

Projects with this topic

View Vllm project

mirrored_repos / MachineLearning / vllm-project / Vllm

🔧🔗https://github.com/vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

vllm amd cuda inference pytorch transformer Llama gpt rocm model-serving tpu hpu mlops xpu inferentia Large Langua... llm-inference llmops

0

Updated Oct 12, 2025

0 0 0 0

Updated Oct 12, 2025
View Llm Compressor project

mirrored_repos / MachineLearning / vllm-project / Llm Compressor

🔧🔗https://github.com/vllm-project/llm-compressor Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

sparsity compression quantization vllm

0

Updated Oct 12, 2025

0 0 0 0

Updated Oct 12, 2025
View Vllm Ascend project

mirrored_repos / MachineLearning / vllm-project / Vllm Ascend

🔧🔗https://github.com/vllm-project/vllm-ascend Community maintained hardware plugin for vLLM on Ascend

inference transformer model-serving mlops ascend Large Langua... llmops llm-serving vllm

0

Updated Oct 12, 2025

0 0 0 0

Updated Oct 12, 2025
View Ramalama project

mirrored_repos / MachineLearning / containers / Ramalama

🔧🔗https://github.com/containers/ramalama

RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all

Synthetic In... containers cuda intel hip inference-se... podman Large Langua... llamacpp vllm

0

Updated Oct 12, 2025

0 0 0 0

Updated Oct 12, 2025
View Ms Swift project

mirrored_repos / MachineLearning / modelscope / Ms Swift

🔧🔗https://github.com/modelscope/ms-swift

SWIFT (Scalable lightWeight Infrastructure for Fine-Tuning) Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs.

agent deployment lora Llama liger peft multimodal modelscope sft dpo pre-training Large Langua... llava vllm qwen lmdeploy minicpm-v internvl

0

Updated Oct 11, 2025

0 0 0 0

Updated Oct 11, 2025
View Aibrix project

mirrored_repos / MachineLearning / vllm-project / Aibrix

🔧🔗https://github.com/vllm-project/aibrix Cost-efficient and pluggable Infrastructure components for GenAI inference

vllm

0

Updated Oct 11, 2025

0 0 0 0

Updated Oct 11, 2025
View Vllm Spyre project

mirrored_repos / MachineLearning / vllm-project / Vllm Spyre

🔧🔗https://github.com/vllm-project/vllm-spyre

Community maintained hardware plugin for vLLM on Spyre

vllm sypre

0

Updated Oct 10, 2025

0 0 0 0

Updated Oct 10, 2025
View Production Stack project

mirrored_repos / MachineLearning / vllm-project / Production Stack

🔧🔗https://github.com/vllm-project/production-stack

Scale from single vLLM instance to distributed vLLM deployment without changing any application code.

vllm

0

Updated Oct 09, 2025

0 0 0 0

Updated Oct 09, 2025
View FlashMLA project

mirrored_repos / MachineLearning / vllm-project / FlashMLA

🔧🔗https://github.com/vllm-project/FlashMLA

vllm flash-attention

0

Updated Sep 29, 2025

0 0 0 0

Updated Sep 29, 2025
View Neuromancer project

mirrored_repos / MachineLearning / tom / Neuromancer

Neuromancer 🔧🔗https://git.tomfos.tr/tom
Self-hosted, GPU-optimised GenAI platform providing a drop-in OpenAI-compatible API

openai-api Large Langua... vllm

0

Updated Sep 20, 2025

0 0 0 0

Updated Sep 20, 2025
View Dashboard project

mirrored_repos / MachineLearning / vllm-project / Dashboard

🔧🔗https://github.com/vllm-project/dashboard

vllm

0

Updated Feb 26, 2025

0 0 0 0

Updated Feb 26, 2025
View Vllm Allocator Adaptor project

mirrored_repos / MachineLearning / vllm-project / Vllm Allocator Adaptor

🔧🔗https://github.com/vllm-project/vllm_allocator_adaptor An adaptor to allow Python allocator for PyTorch pluggable allocator

vllm

0

Updated Feb 26, 2025

0 0 0 0

Updated Feb 26, 2025
View Llama Recipes project

mirrored_repos / MachineLearning / meta-llama / Llama Recipes

🔧🔗 meta-llama/llama-recipes Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs.

Python Machine Lear... Synthetic In... pytorch llamaindex fine-tuning Large Langua... langchain vllm llama2 llama3

0

Updated Jan 17, 2025

0 0 0 0

Updated Jan 17, 2025

🐾❤️ Strive to be the person your dogs believe you are ❤️🐾