Llama

Any
Batchfile
C
C#
C++
CMake
CSS
Dockerfile
Go
HCL
HTML
Java
JavaScript
Jinja
Jupyter Notebook
Makefile
PHP
Python
Ruby
Rust
SCSS
Shell
Swift
TSX
TypeScript
Vue

Projects with this topic

Sort by:

Sort by
Updated date
Name
Name, descending
Oldest updated
Oldest created
Last created
Most stars
Hide archived projects
Show archived projects
Show archived projects only

View Vllm project

mirrored_repos / MachineLearning / vllm-project / Vllm

🔧🔗https://github.com/vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

vllm amd cuda inference pytorch transformer Llama gpt rocm model-serving tpu hpu mlops xpu inferentia Large Langua... llm-inference llmops

0

Updated Apr 30, 2026

0 0 0 0

Updated Apr 30, 2026
View Lmdeploy project

mirrored_repos / MachineLearning / InternLM / Lmdeploy

https://github.com/InternLM/lmdeploy LMDeploy is a toolkit for compressing, deploying, and serving LLMs. 🔗 lmdeploy.readthedocs.io/en/latest/

Llama cuda-kernels deepspeed Large Langua... fastertransf... llm-inference turbomind internlm llama2 codellama llama3

0

Updated Apr 29, 2026

0 0 0 0

Updated Apr 29, 2026

🐾❤️ Strive to be the person your dogs believe you are ❤️🐾