LanguageModelContinuousBatchingΒΆ

Language model that uses continuous batching.

PyTorch

API: β€”
Strategy: Custom / Partial

PaxML / Praxis

API: paxml.layers.LanguageModelContinuousBatching
Strategy: Direct Mapping