AttentionProjection¶

Layer that computes multi-head projection for dot-product attention.

Abstract Signature:

AttentionProjection(input_dim: int, num_heads: int, dim_per_head: int, is_output: bool = False)

API: torch.nn.Linear

Strategy: Direct Mapping

API: flax.nnx.Linear

Strategy: Direct Mapping

API: praxis.layers.AttentionProjection

Strategy: Direct Mapping