L
Large Language Models

Projects with this topic

View Vllm project

mirrored_repos / MachineLearning / vllm-project / Vllm

🔧🔗https://github.com/vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

vllm amd cuda inference pytorch transformer Llama gpt rocm model-serving tpu hpu mlops xpu inferentia Large Langua... llm-inference llmops

0

Updated Feb 02, 2026

0 0 0 0

Updated Feb 02, 2026
View Vllm Ascend project

mirrored_repos / MachineLearning / vllm-project / Vllm Ascend

🔧🔗https://github.com/vllm-project/vllm-ascend Community maintained hardware plugin for vLLM on Ascend

inference transformer model-serving mlops ascend Large Langua... llmops llm-serving vllm

0

Updated Feb 02, 2026

0 0 0 0

Updated Feb 02, 2026
View Lmdeploy project

mirrored_repos / MachineLearning / InternLM / Lmdeploy

https://github.com/InternLM/lmdeploy LMDeploy is a toolkit for compressing, deploying, and serving LLMs. 🔗 lmdeploy.readthedocs.io/en/latest/

Llama cuda-kernels deepspeed Large Langua... fastertransf... llm-inference turbomind internlm llama2 codellama llama3

0

Updated Feb 02, 2026

0 0 0 0

Updated Feb 02, 2026
View Dash Infer project

mirrored_repos / MachineLearning / modelscope / Dash Infer

🔧🔗https://github.com/modelscope/dash-infer

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.

modelscope cpu llm-inference Large Langua... native-engine

0

Updated Jul 29, 2025

0 0 0 0

Updated Jul 29, 2025
View Cortex.Tensorrt Llm project

mirrored_repos / MachineLearning / menloresearch / Cortex.Tensorrt Llm

https://github.com/janhq/cortex.tensorrt-llm Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.

nvidia jan tensorrt Large Langua... tensorrt-llm

0

Updated Jan 20, 2025

0 0 0 0

Updated Jan 20, 2025
View Qwen.Cpp project

mirrored_repos / MachineLearning / QwenLM / Qwen.Cpp

🔧🔗https://github.com/QwenLM/qwen.cpp C++ implementation of Qwen-LM

C qwen qwen2 cpp Large Langua...

0

Updated Dec 16, 2024

0 0 0 0

Updated Dec 16, 2024
View RemoveDup project

mirrored_repos / LibreTranslate / RemoveDup

https://github.com/LibreTranslate/RemoveDup Remove duplicates from parallel corpora

clean duplicates deduplication Large Langua...

0

Updated Aug 17, 2024

0 0 0 0

Updated Aug 17, 2024

🐾❤️ Strive to be the person your dogs believe you are ❤️🐾