Skip to content
GitLab
Explore
Sign in
Overview
Active
Stale
All
This project is mirrored from
https://github.com/huggingface/optimum-quanto
. Pull mirroring updated
Sep 19, 2024
.
hqq_optimizer
8ae0ceb7
·
feat(optimizers): add HQQ optimizer
·
Apr 05, 2024
ci_check_commits
e4d5fe39
·
docs: update contributing
·
Apr 02, 2024
refactor_tensors
1c407479
·
wip
·
Mar 29, 2024
optimizers
f2834628
·
refactor: introduce optimizers
·
Mar 25, 2024
awq
a42717e5
·
feat(bench): add AWQ kernels benchmark
·
Mar 21, 2024
benchmark_libs
7a0bf7e1
·
wip
·
Mar 20, 2024
fix-serialization
d528a3fc
·
style
·
Mar 19, 2024
benchmark_readme
a2aa0276
·
doc(generation): add more charts to README
·
Mar 15, 2024
try-4bit-mm
c967cff3
·
feat(udqmm): add c++ and python implementation
·
Mar 14, 2024
release-v0.1.0
fe2b3139
·
release: 0.1.0
·
Mar 13, 2024
mixed_mm
c116e343
·
refactor(tensor): remove dispatch args
·
Feb 22, 2024
use_library
45a99a8c
·
feat(bench): allow to disable extensions in generation bench
·
Feb 19, 2024
tracking_mode
117512c8
·
feat(calibration): identify and log fallbacks in debug mode
·
Dec 19, 2023
quantized_wrapper
0dca8fa1
·
feat(quantize): quantize with QActivationWrapper
·
Nov 30, 2023
Prev
1
2
3
4
Next