num_headsΒΆ
Auto-generated from flax_nnx_code_defs
PyTorch
API:
torch.nn.modules.activation.MultiheadAttention.num_headsStrategy: Direct Mapping
Apple MLX
API:
mlx.nn.layers.transformer.MultiHeadAttention.num_headsStrategy: Direct Mapping