Explore projects

V

mirrored_repos / MachineLearning / vllm-project / Vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

0

Updated Jul 27, 2024

0 0 0 0

Updated Jul 27, 2024
C

mirrored_repos / Coturn

https://github.com/coturn/coturn

networking turn stun
+ 1 more

0

Updated Jul 27, 2024

0 0 0 0

Updated Jul 27, 2024
mirrored_repos / MachineLearning / JanHQ / Cortex.Llamacpp

cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server at runtime.

0

Updated Jul 26, 2024

0 0 0 0

Updated Jul 26, 2024
mirrored_repos / MachineLearning / ml-explore / Mlx

MLX: An array framework for Apple silicon

mlx apple silicon

0

Updated Jul 26, 2024

0 0 0 0

Updated Jul 26, 2024
mirrored_repos / MachineLearning / ml-explore / Mlx Swift

Swift API for MLX

mlx swift apple
+ 1 more

0

Updated Jul 26, 2024

0 0 0 0

Updated Jul 26, 2024
G

mirrored_repos / MachineLearning / AnswerDotAI / Gpu.Cpp

https://github.com/AnswerDotAI/gpu.cpp A lightweight library for portable low-level GPU computation using WebGPU.

0

Updated Jul 26, 2024

0 0 0 0

Updated Jul 26, 2024
mirrored_repos / video / yt-dlp / FFmpeg Builds

Forked from BtbN/FFmpeg-Builds FFmpeg Builds for yt-dlp

yt-dlp ffmpeg build

0

Updated Jul 26, 2024

0 0 0 0

Updated Jul 26, 2024
N

mirrored_repos / MachineLearning / JanHQ / Nitro

0

Updated Jul 26, 2024

0 0 0 0

Updated Jul 26, 2024
mirrored_repos / MachineLearning / JanHQ / Cortex

https://github.com/janhq/cortex Drop-in, local AI alternative to the OpenAI stack. Multi-engine (llama.cpp, TensorRT-LLM, ONNX). Powers 👋 Jan

ai cuda llama
+ 11 more

0

Updated Jul 26, 2024

0 0 0 0

Updated Jul 26, 2024
mirrored_repos / MachineLearning / JanHQ / Cortex.Tensorrt Llm

https://github.com/janhq/cortex.tensorrt-llm Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.

nvidia jan tensorrt
+ 2 more

0

Updated Jul 26, 2024

0 0 0 0

Updated Jul 26, 2024
mirrored_repos / micropython / Micropython

MicroPython - a lean and efficient Python implementation for microcontrollers and constrained systems

python micropython embedded
+ 1 more

0

Updated Jul 26, 2024

0 0 0 0

Updated Jul 26, 2024
A

mirrored_repos / MachineLearning / Stability AI / Api Interfaces

0

Updated Jul 23, 2024

0 0 0 0

Updated Jul 23, 2024
Q

mirrored_repos / MachineLearning / spcl / QuaRot

Code for QuaRot, an end-to-end 4-bit inference of large language models.

arxiv.org/abs/2404.00456

0

Updated Jul 22, 2024

0 0 0 0

Updated Jul 22, 2024
mirrored_repos / MachineLearning / TensTorrent / tt-umd

User-Mode Driver for Tenstorrent hardware

tenstorrent user-mode-dr...

0

Updated Jul 18, 2024

0 0 0 0

Updated Jul 18, 2024
F

mirrored_repos / fnproject / Fdk Java

0

Updated Jul 16, 2024

0 0 0 0

Updated Jul 16, 2024
W

mirrored_repos / MachineLearning / Picovoice / Web Utils

https://github.com/Picovoice/web-utils Package containing web utility functions for Picovoice web bindings and sdks.

0

Updated Jul 12, 2024

0 0 0 0

Updated Jul 12, 2024
T

mirrored_repos / MachineLearning / MLC-AI / Tokenizers Cpp

0

Updated Jun 27, 2024

0 0 0 0

Updated Jun 27, 2024
T

mirrored_repos / cad-cam-3dp / Sienci-Labs / Trinamic Library

0

Updated Jun 21, 2024

0 0 0 0

Updated Jun 21, 2024
P

mirrored_repos / cad-cam-3dp / Sienci-Labs / Plugins Motor

0

Updated Jun 21, 2024

0 0 0 0

Updated Jun 21, 2024
mirrored_repos / MachineLearning / coqui-ai / Inference Engine

https://github.com/coqui-ai/inference-engine Coqui Inference Engine

llm-inference

0

Updated Jun 09, 2024

0 0 0 0

Updated Jun 09, 2024