AttentionLayerΒΆ
Dot-product attention layer, a.k.a. Luong-style attention.
Abstract Signature:
AttentionLayer(use_scale: bool = False, score_mode: str = dot, dropout: float = 0.0, seed: int)
JAX (Core)
API:
βStrategy: Custom / Partial
Apple MLX
API:
βStrategy: Custom / Partial
PaxML / Praxis
API:
βStrategy: Custom / Partial