AdditiveAttentionΒΆ
Additive attention layer, a.k.a. Bahdanau-style attention.
Abstract Signature:
AdditiveAttention(use_scale: bool = True, dropout: float = 0.0)
PyTorch
API:
βStrategy: Custom / Partial
JAX (Core)
API:
βStrategy: Custom / Partial
Apple MLX
API:
βStrategy: Custom / Partial
PaxML / Praxis
API:
βStrategy: Custom / Partial