Skip to content
GitLab
Explore
Sign in
Overview
Active
Stale
All
This project is mirrored from
https://github.com/ggerganov/llama.cpp
. Pull mirroring updated
Sep 19, 2024
.
ik/context_extend
333c40b9
·
Fixed typo
·
Jun 27, 2023
try-fix-metal
5cc672a9
·
metal : try to utilize more of the shared memory using smaller views
·
Jun 26, 2023
avoid-gnu-source
78fafcaf
·
ggml : do not use _GNU_SOURCE gratuitously
·
Jun 25, 2023
ik/q4_k_fast
bd49a86a
·
Q4_K_F: 2nd shot at Metal
·
Jun 17, 2023
ik/k_quants
9c8536e5
·
Had unintentionally committed the Makefile with -Ofast enabled
·
Jun 05, 2023
fix_clblast
20054a38
·
Fix directory name
·
May 27, 2023
cmake-change
75649f44
·
Change CMake files
·
May 24, 2023
fix-benchmark-matmult
b16c085c
·
examples : fix benchmark-matmult
·
May 21, 2023
chunks
a1cdd29c
·
ggml : rms_norm in chunks
·
May 20, 2023
steering
95dc4d72
·
Merge 'origin/master' into steering
·
May 19, 2023
f16c
40ec4882
·
ggml : use F16C conversion when available
·
May 17, 2023
dequantize-matmul-3-gg
a3e6d622
·
cuda : alternative q4_q8 kernel
·
May 12, 2023
remove-vzip
e116eb63
·
ggml : speed-up Q5_0 + Q5_1 at 4 threads
·
May 11, 2023
jed/spm-clblast
4baa8563
·
Fix build
·
May 06, 2023
ci_cublas
31ff9e2e
·
ci : add cublas to windows release
·
May 03, 2023
q4_3-range-fix
102cd980
·
ggml : Q4_3c using 2x "Full range" approach
·
Apr 23, 2023
q4_0-q4_2-range-fix
71e6ae37
·
ggml : continue from #729 (wip)
·
Apr 22, 2023
gg/rmse_quantization
a0242a83
·
Minor, plus rebase on master
·
Apr 22, 2023
quant-attn
4b8d5e38
·
llama : quantize attention results
·
Apr 22, 2023
mmap-pages-stats
15067374
·
Add mmap pages stats (disabled by default)
·
Apr 16, 2023
Prev
1
…
21
22
23
24
25
26
Next