DeepSpeed

Model Setup

Training API

Training API

Inference API

Inference API
- Forward Propagation

Checkpointing API

ZeRO API

ZeRO

Mixture of Experts

Transformer Kernel API

Transformer Kernels
- DeepSpeed Transformer Config
- DeepSpeed Transformer Layer

Pipeline Parallelism

Pipeline Parallelism

Optimizers

Optimizers

Learning Rate Schedulers

Learning Rate Schedulers

Flops Profiler

Flops Profiler
- FlopsProfiler
- get_model_profile()

Autotuning

Autotuning
- Autotuner

Memory Usage

Memory Requirements
- API To Estimate Memory Usage
- Discussion

Monitoring

Monitoring
- TensorBoard
- WandB
- Comet
- CSV Monitor

Indices and tables