Skip to content
GitLab
Explore
Sign in
Tags
Tags give the ability to mark specific points in history as being important
This project is mirrored from
https://github.com/ggerganov/llama.cpp
. Pull mirroring updated
Sep 19, 2024
.
b3667
581c3051
·
ggml : AVX2 support for Q4_0_8_8 (#8713)
·
Sep 04, 2024
b3666
5910ea94
·
[SYCL] Fix DMMV dequantization (#9279)
·
Sep 04, 2024
b3664
82e3b03c
·
rpc : make RPC servers come first in the device list (#9296)
·
Sep 04, 2024
b3661
8962422b
·
llama-bench : add JSONL (NDJSON) output mode (#9288)
·
Sep 03, 2024
b3658
f1485161
·
src: make tail invalid when kv cell is intersection for mamba (#9249)
·
Sep 02, 2024
b3656
f771d064
·
ggml : add pthread includes on FreeBSD (#9258)
·
Sep 02, 2024
b3655
6e7d133a
·
server : refactor multitask handling (#9274)
·
Sep 02, 2024
b3654
b60074f1
·
llama-cli : remove duplicated log message (#9275)
·
Sep 02, 2024
b3652
c6d4cb46
·
llama : minor style
·
Sep 02, 2024
b3651
8f1d81a0
·
llama : support RWKV v6 models (#8980)
·
Sep 01, 2024
b3649
ea5d7478
·
sgemm : improved Q4_0 and Q8_0 performance via 4xN and Mx4 gemm (#8908)
·
Aug 31, 2024
b3647
0ab30f8d
·
llama : fix llama_split_mode enum values in main_gpu document (#9057)
·
Aug 30, 2024
b3645
7ea8d80d
·
llava : the function "clip" should be int (#9237)
·
Aug 30, 2024
b3644
42c76d13
·
Threadpool: take 2 (#8672)
·
Aug 30, 2024
b3643
9f7d4bcf
·
server : fix crash when error handler dumps invalid utf-8 json (#9195)
·
Aug 30, 2024
b3639
20f1789d
·
vulkan : fix build (#0)
·
Aug 27, 2024
b3636
78eb487b
·
llama : fix qs.n_attention_wv for DeepSeek-V2 (#9156)
·
Aug 27, 2024
b3635
a77feb5d
·
server : add some missing env variables (#9116)
·
Aug 27, 2024
b3634
2e59d61c
·
llama : fix ChatGLM4 wrong shape (#9194)
·
Aug 27, 2024
b3633
75e1dbba
·
llama : fix llama3.1 rope_freqs not respecting custom head_dim (#9141)
·
Aug 27, 2024
Prev
1
2
3
4
5
6
7
8
9
…
122
Next