Skip to content
GitLab
Explore
Sign in
Overview
Active
Stale
All
This project is mirrored from
https://github.com/huggingface/optimum-quanto
. Pull mirroring updated
Sep 19, 2024
.
quantized_wrapper
0dca8fa1
·
feat(quantize): quantize with QActivationWrapper
·
Nov 30, 2023
tracking_mode
117512c8
·
feat(calibration): identify and log fallbacks in debug mode
·
Dec 19, 2023
use_library
45a99a8c
·
feat(bench): allow to disable extensions in generation bench
·
Feb 19, 2024
mixed_mm
c116e343
·
refactor(tensor): remove dispatch args
·
Feb 22, 2024
release-v0.1.0
fe2b3139
·
release: 0.1.0
·
Mar 13, 2024
try-4bit-mm
c967cff3
·
feat(udqmm): add c++ and python implementation
·
Mar 14, 2024
benchmark_readme
a2aa0276
·
doc(generation): add more charts to README
·
Mar 15, 2024
fix-serialization
d528a3fc
·
style
·
Mar 19, 2024
benchmark_libs
7a0bf7e1
·
wip
·
Mar 20, 2024
awq
a42717e5
·
feat(bench): add AWQ kernels benchmark
·
Mar 21, 2024
optimizers
f2834628
·
refactor: introduce optimizers
·
Mar 25, 2024
refactor_tensors
1c407479
·
wip
·
Mar 29, 2024
ci_check_commits
e4d5fe39
·
docs: update contributing
·
Apr 02, 2024
hqq_optimizer
8ae0ceb7
·
feat(optimizers): add HQQ optimizer
·
Apr 05, 2024
gpu_ci
99c4cfc2
·
ci(cuda): install CUDA toolkit to build extension
·
Apr 10, 2024
macos_ci
d2f9a45d
·
fix: add missing setuptools dependency
·
Apr 11, 2024
add_licence_headers
18e3035f
·
chore: add license headers in cpp files
·
Apr 12, 2024
awq_packing
5507c9cb
·
wip
·
Apr 13, 2024
stale_bot
3703469b
·
ci: add stale bot
·
Apr 15, 2024
awq_kernels
70d081c3
·
feat(awq): add library and CUDA extension
·
Apr 16, 2024
Prev
1
2
Next