L
Large Language Models

Projects with this topic

View Ollam project

mirrored_repos / MachineLearning / ollama / Ollam

Ollama
https://github.com/ollama/ollama Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

go GoLang Llama glm minimax gemma mistral Large Langua... llms ollama qwen deepseek gpt-oss Synthetic In...

0

Updated Jun 10, 2026

0 0 0 0

Updated Jun 10, 2026
View Vllm Ascend project

mirrored_repos / MachineLearning / vllm-project / Vllm Ascend

🔧🔗https://github.com/vllm-project/vllm-ascend Community maintained hardware plugin for vLLM on Ascend

inference transformer model-serving mlops ascend Large Langua... llmops llm-serving vllm

0

Updated Jun 10, 2026

0 0 0 0

Updated Jun 10, 2026
View Llm.C project

mirrored_repos / MachineLearning / karpathy / Llm.C

LLM.c
LLM training in simple, raw C/CUDA LLMs in simple, pure C/CUDA with no need for 245MB of PyTorch or 107MB of cPython. Current focus is on pretraining, in particular reproducing the GPT-2 and GPT-3 miniseries, along with a parallel PyTorch ref

Large Langua... llm-training cuda C

0

Updated Nov 01, 2025

0 0 0 0

Updated Nov 01, 2025
View Dash Infer project

mirrored_repos / MachineLearning / modelscope / Dash Infer

🔧🔗https://github.com/modelscope/dash-infer

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.

modelscope cpu llm-inference Large Langua... native-engine

0

Updated Jul 29, 2025

0 0 0 0

Updated Jul 29, 2025
View Nitro project

mirrored_repos / MachineLearning / menloresearch / Nitro

https://github.com/janhq/nitro.git now: https://github.com/janhq/cortex.git Drop-in, local AI alternative to the OpenAI stack. Multi-engine (llama.cpp, TensorRT-LLM, ONNX). Powers 👋 Jan

Synthetic In... cuda Llama accellerated inference-en... openai-api Large Langua... stable-diffu... llamacpp llama2 llama3 gguf tensorrt-llm

0

Updated Jul 04, 2025

0 0 0 0

Updated Jul 04, 2025
View Cortex.Tensorrt Llm project

mirrored_repos / MachineLearning / menloresearch / Cortex.Tensorrt Llm

https://github.com/janhq/cortex.tensorrt-llm Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.

nvidia jan tensorrt Large Langua... tensorrt-llm

0

Updated Jan 20, 2025

0 0 0 0

Updated Jan 20, 2025
View Qwen.Cpp project

mirrored_repos / MachineLearning / QwenLM / Qwen.Cpp

🔧🔗https://github.com/QwenLM/qwen.cpp C++ implementation of Qwen-LM

C qwen qwen2 cpp Large Langua...

0

Updated Dec 16, 2024

0 0 0 0

Updated Dec 16, 2024
View APAR project

mirrored_repos / MachineLearning / THUDM / APAR

https://github.com/THUDM/APAR APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding

Large Langua... parallel auto-regressive

0

Updated Sep 30, 2024

0 0 0 0

Updated Sep 30, 2024