QuantizedAllToShardedLinearΒΆ

Quantized distributed linear layer.

Abstract Signature:

QuantizedAllToShardedLinear(in_features: int, out_features: int)

PyTorch

API: β€”
Strategy: Custom / Partial

Apple MLX

API: mlx.nn.QuantizedAllToShardedLinear
Strategy: Direct Mapping