Skip to content
GitLab
Explore
Sign in
Overview
Active
Stale
All
This project is mirrored from
https://github.com/ggerganov/llama.cpp
. Pull mirroring updated
Sep 20, 2024
.
llama-refactor
afb39292
·
Merge branch 'master' into llama-refactor
·
Oct 31, 2023
llm-reuse-constants
7420bef8
·
wip wip wip
·
Nov 01, 2023
llm-build-context
a8796f96
·
llm : cleanup + comments
·
Nov 01, 2023
batched-krn
1354122c
·
fix warnings
·
Nov 01, 2023
metal-soft-max
46868a49
·
metal : multi-simd softmax
·
Nov 01, 2023
fix-metal-after-yarn
396412c0
·
metal : fix build errors and kernel sig after #2268
·
Nov 02, 2023
remove-ggufv1
347d587a
·
gguf : remove special-case code for GGUFv1
·
Nov 02, 2023
disable-cmake-native
c217a66b
·
disable LLAMA_NATIVE by default
·
Nov 02, 2023
gguf-grace
f3069478
·
gguf : print error for GGUFv1 files
·
Nov 02, 2023
fix-cuda-warnings
d1a1678b
·
Merge branch 'master' into fix-cuda-warnings
·
Nov 02, 2023
cuda-dmmv-dims
166e44b7
·
ggml-cuda : move row numbers to x grid dim in mmv kernels
·
Nov 03, 2023
yet-another-yarn-fix
88ff0e39
·
common : YAYF (yet another YARN fix)
·
Nov 03, 2023
revert-pool
3ef358ff
·
Revert "cuda : use CUDA memory pool with async memory allocation/deallocation...
·
Nov 04, 2023
fix-tensor-split-zero
47d604fa
·
fix issues
·
Nov 05, 2023
fix-libllava-ld
1b723a8d
·
make : do not add linker flags when compiling static llava lib
·
Nov 07, 2023
backend-alloc-v-fix
3469f5a9
·
ggml-alloc : fix backend assignments of views
·
Nov 08, 2023
llama-metadata
d0445a2e
·
better documentation
·
Nov 10, 2023
fix-gguf-convert-endian
b226d07d
·
Bump version and upd description
·
Nov 11, 2023
fix-llava-regression-sq-img
59254360
·
llava : fix regression for square images in #3613
·
Nov 13, 2023
ceb/restore-prefix-space
5e899428
·
do not add space prefix if the first token is special
·
Nov 14, 2023
Prev
1
…
8
9
10
11
12
13
14
15
16
…
26
Next