L
llm-serving

  • Any
  • C
  • C#
  • C++
  • CMake
  • CSS
  • Dockerfile
  • Go
  • HCL
  • HTML
  • Java
  • JavaScript
  • Jinja
  • Jupyter Notebook
  • MDX
  • Makefile
  • PHP
  • Python
  • Ruby
  • Rust
  • SCSS
  • Shell
  • Swift
  • TSX
  • TypeScript
  • Vue

Projects with this topic

Sort by:
  • Sort by
  • Updated date
  • Name
  • Name, descending
  • Oldest updated
  • Oldest created
  • Last created
  • Most stars
  • Hide archived projects
  • Show archived projects
  • Show archived projects only
  • View Vllm Ascend project

    mirrored_repos / MachineLearning / vllm-project / Vllm Ascend

    🔧🔗https://github.com/vllm-project/vllm-ascend Community maintained hardware plugin for vLLM on Ascend

    inference transformer model-serving mlops ascend Large Langua... llmops llm-serving vllm
    0
    Updated Dec 13, 2025
    0 0 0 0
    Updated Dec 13, 2025
  • View Sglang project

    mirrored_repos / MachineLearning / sgl-project / Sglang

    🔧🔗https://github.com/sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    cuda inference pytorch transformer moe Llama vlm Large Langua... llm-serving llava deepseek llama3
    0
    Updated Dec 01, 2025
    0 0 0 0
    Updated Dec 01, 2025

🐾❤️ Strive to be the person your dogs believe you are ❤️🐾