llm-inference

Projects with this topic

mirrored_repos / MachineLearning / InternLM / Lmdeploy

https://github.com/InternLM/lmdeploy LMDeploy is a toolkit for compressing, deploying, and serving LLMs. 🔗 lmdeploy.readthedocs.io/en/latest/

Llama cuda-kernels deepspeed
+ 8 more

0

Updated Dec 22, 2024

0 0 0 0

Updated Dec 22, 2024
mirrored_repos / MachineLearning / modelscope / Dash Infer

🔧🔗https://github.com/modelscope/dash-infer

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.

modelscope cpu llm-inference
+ 2 more

0

Updated Dec 12, 2024

0 0 0 0

Updated Dec 12, 2024
mirrored_repos / MachineLearning / huggingface / Parler Tts

https://github.com/huggingface/parler-tts Inference and training library for high-quality TTS models.

parler text-to-speech llm-inference

0

Updated Dec 10, 2024

0 0 0 0

Updated Dec 10, 2024
mirrored_repos / MachineLearning / Lightning-AI / Litgpt

https://github.com/Lightning-AI/litgpt 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale. 🔗 https://lightning.ai/

Deep Learning Synthetic In... large-langua...
+ 2 more

0

Updated Nov 28, 2024

0 0 0 0

Updated Nov 28, 2024
mirrored_repos / MachineLearning / bytedance / ShadowKV

🔧🔗https://github.com/bytedance/ShadowKV ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

research low-rank sparse-atten...
+ 2 more

0

Updated Nov 27, 2024

0 0 0 0

Updated Nov 27, 2024
mirrored_repos / MachineLearning / mistralai / Mistral Inference

https://github.com/mistralai/mistral-inference Official inference library for Mistral models

mistral.ai/

Large Langua... llm-inference mistral
+ 2 more

0

Updated Nov 12, 2024

0 0 0 0

Updated Nov 12, 2024
mirrored_repos / MachineLearning / arcee-ai / Arcee Python

The Arcee client for executing domain-adpated language model routines

Synthetic In... Large Langua... llmops
+ 2 more

0

Updated Oct 08, 2024

0 0 0 0

Updated Oct 08, 2024
mirrored_repos / MachineLearning / coqui-ai / Inference Engine

https://github.com/coqui-ai/inference-engine Coqui Inference Engine

llm-inference

0

Updated Jun 09, 2024

0 0 0 0

Updated Jun 09, 2024
mirrored_repos / MachineLearning / noco-ai / Spellbook Docker

AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models https://github.com/noco-ai/spellbook-docker/wiki

text-to-speech bark automatic-sp...
+ 7 more

0

Updated May 01, 2024

0 0 0 0

Updated May 01, 2024