T

tensorrt-llm

Projects with this topic

mirrored_repos / MachineLearning / JanHQ / Cortex.Tensorrt Llm

https://github.com/janhq/cortex.tensorrt-llm Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.

nvidia jan tensorrt
+ 2 more

0

Updated Nov 01, 2024

0 0 0 0

Updated Nov 01, 2024

🐾❤️ Strive to be the person your dogs believe you are ❤️🐾