T

tensorrt

Projects with this topic

mirrored_repos / MachineLearning / roboflow / Inference

https://github.com/roboflow/inference A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.

python docker deployment
+ 17 more

0

Updated Nov 22, 2024

0 0 0 0

Updated Nov 22, 2024
mirrored_repos / MachineLearning / JanHQ / Cortex

https://github.com/janhq/cortex Drop-in, local AI alternative to the OpenAI stack. Multi-engine (llama.cpp, TensorRT-LLM, ONNX). Powers 👋 Jan

ai cuda llama
+ 11 more

0

Updated Nov 16, 2024

0 0 0 0

Updated Nov 16, 2024
mirrored_repos / MachineLearning / JanHQ / Cortex.Tensorrt Llm

https://github.com/janhq/cortex.tensorrt-llm Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.

nvidia jan tensorrt
+ 2 more

0

Updated Nov 01, 2024

0 0 0 0

Updated Nov 01, 2024
mirrored_repos / MachineLearning / Stochastic / X Stable Diffusion

Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty: https://discord.com/invite/TgHXuSJEk6

docker notebook nvfuser
+ 9 more

0

Updated Apr 14, 2024

0 0 0 0

Updated Apr 14, 2024

🐾❤️ Strive to be the person your dogs believe you are ❤️🐾