Skip to content
GitLab
Explore
Sign in
Overview
Active
Stale
All
This project is mirrored from
https://github.com/ggerganov/ggml
. Pull mirroring updated
Sep 19, 2024
.
llama
7f32376b
·
llama : initial working FP16 + 4-bit Q4_0
·
Mar 10, 2023
4bit
baeb88b8
·
tests : add 4-bit Clover-based quantization
·
Feb 21, 2023
t5
1d38a69d
·
t5 : initial load in ggml
·
Jan 02, 2023
experiments/blocking
3afb833f
·
wip : unsuccessful attempts speeding mul_mat using blocking
·
Oct 13, 2022
Prev
1
2
Next