Skip to content
GitLab
Explore
Sign in
Overview
Active
Stale
All
This project is mirrored from
https://github.com/huggingface/optimum-quanto
. Pull mirroring updated
Sep 16, 2024
.
Active branches
main
default
e0353a5d
·
fix(library): disable int_mm for CPU
·
Sep 16, 2024
disable_int_mm_cpu
0a832208
·
fix(library): disable int_mm for CPU
·
Sep 16, 2024
add_marlin_int4_kernel
6e888fda
·
fix(marlin): avoid kernel crash on H100
·
Sep 13, 2024
marlin_neural_magic_step_by_step
06629972
·
fix(marlin_: avoid kernel crash on H100
·
Sep 05, 2024
marlin_neural_magic
6751ecad
·
style: reapply NeuralMagic formatting
·
Sep 03, 2024
Show more active branches
Stale branches
quantized_wrapper
0dca8fa1
·
feat(quantize): quantize with QActivationWrapper
·
Nov 30, 2023
tracking_mode
117512c8
·
feat(calibration): identify and log fallbacks in debug mode
·
Dec 19, 2023
use_library
45a99a8c
·
feat(bench): allow to disable extensions in generation bench
·
Feb 19, 2024
mixed_mm
c116e343
·
refactor(tensor): remove dispatch args
·
Feb 22, 2024
release-v0.1.0
fe2b3139
·
release: 0.1.0
·
Mar 13, 2024
Show more stale branches