Skip to content
GitLab
Explore
Sign in
Overview
Active
Stale
All
This project is mirrored from
https://github.com/ggerganov/llama.cpp
. Pull mirroring updated
Sep 19, 2024
.
fix-refact
acead654
·
Merge branch 'master' into fix-refact
·
Oct 08, 2023
fix-kv-cache-access
ee268b54
·
llama : no longer perform uninitialized access to the KV cache
·
Oct 08, 2023
fix-metal-mul-mm
fdd5ad9a
·
metal : do not use mul_mm kernels when ne00 < 64
·
Oct 08, 2023
alloc-assert-fix
ee745692
·
ggml-alloc : fix assert in debug builds
·
Oct 09, 2023
batched-bench
2fcdf869
·
batched-bench : add mmq CLI arg
·
Oct 11, 2023
fix-server-kv-cache-manage
058e83ca
·
server : fix kv cache management
·
Oct 12, 2023
llava
0bd7e69d
·
do not use Wno-cast-qual for MSVC
·
Oct 12, 2023
rev-sampling
5261aee8
·
sampling : one sequence per sampling context
·
Oct 12, 2023
ggml-enum-finetune-fix
a85229c4
·
ggml : add context enumeration functions
·
Oct 12, 2023
ttfs-alloc-fix
32fe1a58
·
train-text-from-scratch : fix assert failure in ggml-alloc
·
Oct 13, 2023
llava-fix-offloading
932589c0
·
Honor -ngl option for Cuda offloading in llava
·
Oct 14, 2023
fix-llava
20131fef
·
set seed
·
Oct 16, 2023
fix-k-quants
317dc4bc
·
k-quants : fix quantization ranges
·
Oct 16, 2023
fix-save-load-state
b3838fe0
·
ci : add test for save-load-state example
·
Oct 17, 2023
fix-cuda-embeddings
1c9c215f
·
fix embeddings when using CUDA
·
Oct 17, 2023
speculative-tree
ad2727d0
·
Merge branch 'master' into speculative-tree
·
Oct 18, 2023
fix-segfault-nonexistent
50093895
·
Better error handling to avoid segfaults for non-existant CLIP models
·
Oct 19, 2023
bakllava
cd6f2180
·
multimodal : add BakLLava conversion support
·
Oct 19, 2023
sampling-refactor
56ba00b9
·
sampling : hide prev behind API and apply #3661
·
Oct 20, 2023
perf-study
cb79f8a2
·
llama : add SKIP_KQ_KQV option
·
Oct 22, 2023
Prev
1
…
6
7
8
9
10
11
12
13
14
…
26
Next