L
llm-inference

  • Any
  • C
  • C#
  • C++
  • CMake
  • CSS
  • Dockerfile
  • Go
  • HCL
  • HTML
  • Java
  • JavaScript
  • Jinja
  • Jupyter Notebook
  • MDX
  • Makefile
  • PHP
  • Python
  • Ruby
  • Rust
  • SCSS
  • Shell
  • Swift
  • TSX
  • TypeScript
  • Vue

Projects with this topic

Sort by:
  • Sort by
  • Updated date
  • Name
  • Name, descending
  • Oldest updated
  • Oldest created
  • Last created
  • Most stars
  • Hide archived projects
  • Show archived projects
  • Show archived projects only
  • View Lmdeploy project

    mirrored_repos / MachineLearning / InternLM / Lmdeploy

    https://github.com/InternLM/lmdeploy LMDeploy is a toolkit for compressing, deploying, and serving LLMs. 🔗 lmdeploy.readthedocs.io/en/latest/

    Llama cuda-kernels deepspeed Large Langua... fastertransf... llm-inference turbomind internlm llama2 codellama llama3
    0
    Updated Dec 12, 2025
    0 0 0 0
    Updated Dec 12, 2025
  • View Dash Infer project

    mirrored_repos / MachineLearning / modelscope / Dash Infer

    🔧🔗https://github.com/modelscope/dash-infer

    DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.

    modelscope cpu llm-inference Large Langua... native-engine
    0
    Updated Jul 29, 2025
    0 0 0 0
    Updated Jul 29, 2025
  • View Inference Engine project

    mirrored_repos / MachineLearning / coqui-ai / Inference Engine

    https://github.com/coqui-ai/inference-engine Coqui Inference Engine

    llm-inference
    0
    Updated Jun 09, 2024
    0 0 0 0
    Updated Jun 09, 2024

🐾❤️ Strive to be the person your dogs believe you are ❤️🐾