Skip to content
GitLab
Explore
Sign in
Overview
Active
Stale
All
This project is mirrored from
https://github.com/ggerganov/llama.cpp
. Pull mirroring updated
Sep 19, 2024
.
zig-build
6fc2847a
·
simplify object var names
·
Aug 04, 2023
ggml-allocator-metal-fix
74fb31bd
·
move asserts
·
Jul 30, 2023
gguf-write-tensor
51105572
·
undo formatting
·
Jul 28, 2023
ggml-allocator
122d7c65
·
allow using the allocator with opencl
·
Jul 27, 2023
gguf-python
af1c9966
·
gguf : start write tensor info
·
Jul 27, 2023
fix-concurrency
1038d1d2
·
metal : fix out-of-bounds access + style changes
·
Jul 27, 2023
ggml-fix-set-unary
f9c3a3fd
·
ggml : fix assert in ggml_set_unary_op
·
Jul 26, 2023
ggml-ctx-graph
1b4fd4e0
·
cleanup
·
Jul 26, 2023
flash-attn-params
e25e15c9
·
fix
·
Jul 25, 2023
ik/llama_dfault_rms_eps
055bee91
·
Add LLAMA_DEFAULT_RMS_EPS so we can change the default
·
Jul 25, 2023
mul-mat-tweaks
450a7c76
·
ggml : mul_mat threads yield
·
Jul 25, 2023
server-eps
3d4359e2
·
server: add rms_norm_eps parameter
·
Jul 24, 2023
ik/metal_q4_0_1_new
7f985612
·
Have N_DST, etc., be template parameters
·
Jul 24, 2023
webchat-escape-html
27d0fcc3
·
Merge remote-tracking branch 'origin/master' into webchat-escape-html
·
Jul 24, 2023
rms-norm-eps-param
3855ea36
·
use scientific notation for eps param in the help
·
Jul 24, 2023
sync
68c9fca9
·
tests : remove unnecessary funcs
·
Jul 24, 2023
ik/fix_scalar_q5k_64
b32538da
·
Fix scalar version of Q5_K when QK_K = 64
·
Jul 24, 2023
ik/cuda_fix_QKK_64_2
e6dd6bc5
·
Very slightly better Q5_K bit fiddling
·
Jul 24, 2023
ik/cuda_q5k
f3a92117
·
Add some comments to satisfy PR reviewer
·
Jul 23, 2023
ik/cuda_fix_QKK_64
8b44eef2
·
Some cleanup
·
Jul 23, 2023
Prev
1
…
19
20
21
22
23
24
25
26
Next