Skip to content
GitLab
Explore
Sign in
Overview
Active
Stale
All
This project is mirrored from
https://github.com/SJTU-IPADS/PowerInfer.git
. Pull mirroring updated
Sep 19, 2024
.
Active branches
main
default
6ae7e06d
·
Fix segmentation fault for models exceeding 40B on AMD GPUs & optimize...
·
Sep 06, 2024
convert-sparsemistral
5b02459b
·
support converting TurboSparse mistral model which embeds MLP in Pytorch tensors
·
Jul 09, 2024
rocm-readme
a7cfb543
·
minor for readme
·
Jun 27, 2024
Stale branches
fix/vram-budget-inaccuracy
diverged from upstream
4d80abd3
·
wip: disable vram budget hard limit temporarily
·
Dec 28, 2023
model-mistral
f270a288
·
support dense Mistral model
·
Feb 05, 2024
144-cmake-317-or-higher-is-required-the-repository-asks-for-version-3134
764347f2
·
Fix CMake requirement in README
·
Feb 18, 2024
fix-compile-worktree
abf4aa93
·
Fix compiling issue under git worktrees
·
Feb 20, 2024
fix/cuda-warning-options
8edcb46c
·
fix: cuda host compiler options at wrong position
·
Mar 07, 2024
Show more stale branches