AttentionProjectionΒΆ
Layer that computes multi-head projection for dot-product attention.
Abstract Signature:
AttentionProjection(input_dim: int, num_heads: int, dim_per_head: int, is_output: bool = False)
Layer that computes multi-head projection for dot-product attention.
Abstract Signature:
AttentionProjection(input_dim: int, num_heads: int, dim_per_head: int, is_output: bool = False)