Skip to content
GitLab
Explore
Sign in
Overview
Active
Stale
All
This project is mirrored from
https://github.com/ggerganov/ggml
. Pull mirroring updated
Sep 19, 2024
.
experiments/blocking
3afb833f
·
wip : unsuccessful attempts speeding mul_mat using blocking
·
Oct 13, 2022
t5
1d38a69d
·
t5 : initial load in ggml
·
Jan 02, 2023
4bit
baeb88b8
·
tests : add 4-bit Clover-based quantization
·
Feb 21, 2023
llama
7f32376b
·
llama : initial working FP16 + 4-bit Q4_0
·
Mar 10, 2023
cerebras-wip
23cc0e44
·
wip wip wip
·
Mar 29, 2023
gq
724c45d5
·
ggml : finalize the Q4_1 quantization for ARM_NEON
·
Mar 29, 2023
fix-mul-mat
c5a21cd2
·
ggml : fix mul_mat src1 indexing when src1 is not contiguous
·
Jul 14, 2023
ci-test
c5d62d8c
·
ci : test push into a different branch
·
Jul 16, 2023
ggml-backend-metal
4f8b5804
·
gpt-2 : remove TODO + update comment
·
Oct 06, 2023
feature/parallel-decoding-gpt2-example
2024e42a
·
Remove not needed exit check
·
Oct 12, 2023
ggml-cpp
a328beae
·
avoid specifiying the namespace in function calls (ADL)
·
Oct 15, 2023
gpt-2-opt
1343784d
·
cuda : op skip
·
Oct 21, 2023
mul-mat-id-batch
e715793e
·
only run GGML_TASK_INIT and GGML_TASK_FINALIZE once
·
Dec 09, 2023
gg/cuda-assert-mul-mat-pad
c1104127
·
cuda : ggml_mul_mat assert for padded src1
·
Dec 29, 2023
test
46e6c0fd
·
test 5
·
Jan 18, 2024
release
03c1ad89
·
sync : whisper.cpp
·
Feb 12, 2024
gg/gguf-spec-diagram
992ac7aa
·
spec : add GGUF diagram
·
Mar 15, 2024
gg/yarn-tests
ef5d14de
·
ggml : add rope tests + fix neox
·
May 29, 2024
gg/remove-whisper-example
0059b30e
·
examples : remove whisper
·
Jun 16, 2024