QuantizeΒΆ
Quantize the sub-modules of a module according to a predicate.
Abstract Signature:
Quantize(model: Module, group_size: int = 64, bits: int = 4)
JAX (Core)
API:
βStrategy: Plugin (quantization)
Keras
API:
keras.layers.ZeroPadding3D.quantizeStrategy: Plugin (quantization)
TensorFlow
API:
βStrategy: Plugin (quantization)
Flax NNX
API:
βStrategy: Plugin (quantization)
PaxML / Praxis
API:
βStrategy: Plugin (quantization)