DeepSpeed: Accelerating large-scale model inference and training via system optimizations and compression - Microsoft Research

Description

Last month, the DeepSpeed Team announced ZeRO-Infinity, a step forward in training models with tens of trillions of parameters. In addition to creating optimizations for scale, our team strives to introduce features that also improve speed, cost, and usability. As the DeepSpeed optimization library evolves, we are listening to the growing DeepSpeed community to learn […]

Announcing the DeepSpeed4Science Initiative: Enabling large-scale scientific discovery through sophisticated AI system technologies - Microsoft Research

TensorFlow to PyTorch for SLEAP: Is it Worth it?

Improving Pre-trained Language Models

DeepSpeed Compression: A composable library for extreme compression and zero-cost quantization - Microsoft Research

Optimization Strategies for Large-Scale DL Training Workloads: Case Study with RN50 on DGX Clusters

LLM（十二）：DeepSpeed Inference 在LLM 推理上的优化探究- 知乎

DeepSpeed: Accelerating large-scale model inference and training via system optimizations and compression - Microsoft Research

DeepSpeed/README.md at master · microsoft/DeepSpeed · GitHub

the comparison of test and training time of benchmark network

DeepSpeed: Accelerating large-scale model inference and training via system optimizations and compression - Microsoft Research

Announcing the DeepSpeed4Science Initiative: Enabling large-scale scientific discovery through sophisticated AI system technologies - Microsoft Research

ZeRO-Infinity and DeepSpeed: Unlocking unprecedented model scale for deep learning training - Microsoft Research