LanguageModelContinuousBatchingΒΆ
Language model that uses continuous batching.
PyTorch
API:
βStrategy: Custom / Partial
PaxML / Praxis
API:
paxml.layers.LanguageModelContinuousBatchingStrategy: Direct Mapping