Projects with this topic
Sort by:
-
🔧 🔗 https://github.com/pytorch/serveServe, optimize and scale PyTorch models in production
Updated -
🔧 🔗 https://github.com/modelscope/dash-inferDashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.
Updated