Branches · mirrored_repos / MachineLearning / Llama.Cpp · GitLab

This project is mirrored from https://github.com/ggerganov/llama.cpp. Pull mirroring updated Sep 19, 2024.

ik/context_extend

333c40b9 · Fixed typo · Jun 27, 2023
try-fix-metal

5cc672a9 · metal : try to utilize more of the shared memory using smaller views · Jun 26, 2023
avoid-gnu-source

78fafcaf · ggml : do not use _GNU_SOURCE gratuitously · Jun 25, 2023
ik/q4_k_fast

bd49a86a · Q4_K_F: 2nd shot at Metal · Jun 17, 2023
ik/k_quants

9c8536e5 · Had unintentionally committed the Makefile with -Ofast enabled · Jun 05, 2023
fix_clblast

20054a38 · Fix directory name · May 27, 2023
cmake-change

75649f44 · Change CMake files · May 24, 2023
fix-benchmark-matmult

b16c085c · examples : fix benchmark-matmult · May 21, 2023
chunks

a1cdd29c · ggml : rms_norm in chunks · May 20, 2023
steering

95dc4d72 · Merge 'origin/master' into steering · May 19, 2023
f16c

40ec4882 · ggml : use F16C conversion when available · May 17, 2023
dequantize-matmul-3-gg

a3e6d622 · cuda : alternative q4_q8 kernel · May 12, 2023
remove-vzip

e116eb63 · ggml : speed-up Q5_0 + Q5_1 at 4 threads · May 11, 2023
jed/spm-clblast

4baa8563 · Fix build · May 06, 2023
ci_cublas

31ff9e2e · ci : add cublas to windows release · May 03, 2023
q4_3-range-fix

102cd980 · ggml : Q4_3c using 2x "Full range" approach · Apr 23, 2023
q4_0-q4_2-range-fix

71e6ae37 · ggml : continue from #729 (wip) · Apr 22, 2023
gg/rmse_quantization

a0242a83 · Minor, plus rebase on master · Apr 22, 2023
quant-attn

4b8d5e38 · llama : quantize attention results · Apr 22, 2023
mmap-pages-stats

15067374 · Add mmap pages stats (disabled by default) · Apr 16, 2023

Prev
1
…
21
22
23
24
25
26
Next

🐾❤️ Strive to be the person your dogs believe you are ❤️🐾