Projects with this topic
Sort by:
-
🔧 🔗 https://github.com/vllm-project/vllm-ascend Community maintained hardware plugin for vLLM on AscendUpdated -
🔧 🔗 https://github.com/vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Updated -
🔧 🔗 https://github.com/sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Updated -
🔧 🔗 https://github.com/gpustack/llama-boxLM inference server implementation based on llama.cpp.
Updated -
https://github.com/THUDM/SwissArmyTransformer SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
Updated