|
1 |
| -Sagemaker Distributed Model Parallel 1.6.0 Release Notes |
| 1 | +############# |
| 2 | +Release Notes |
| 3 | +############# |
| 4 | + |
| 5 | +New features, bug fixes, and improvements are regularly made to the SageMaker |
| 6 | +distributed model parallel library. |
| 7 | + |
| 8 | +Sagemaker Distributed Model Parallel 1.7.0 Release Notes |
2 | 9 | ========================================================
|
3 | 10 |
|
| 11 | +*Date: March. 07. 2022* |
| 12 | + |
| 13 | +**New Features** |
| 14 | + |
| 15 | +**Currency Updates** |
| 16 | + |
| 17 | +* Support for PyTorch 1.10.2 |
| 18 | +* Support for Hugging Face Transformers 4.16.2 |
| 19 | + |
| 20 | +**New Features** |
| 21 | + |
| 22 | +* Support for query key layer scaling to avoid overflow for large model |
| 23 | +* Support for FP32 residual addition to avoid overflow (NaN loss values) for large models when using FP16. |
| 24 | + |
| 25 | +**Improvements** |
| 26 | + |
| 27 | +* Added support for a custom CUDA kernel for softmax to improve throughput |
| 28 | +* Added support for the following `NVIDIA Megatron’s fused kernels |
| 29 | + <https://github.com/NVIDIA/Megatron-LM/tree/main/megatron/fused_kernels>`_: |
| 30 | + |
| 31 | + * Fusion of attention masking and softmax |
| 32 | + * Fusion of bias addition and Gelu activation |
| 33 | + |
| 34 | +**Migration to AWS Deep Learning Containers** |
| 35 | + |
| 36 | +This version passed benchmark testing and is migrated to the following AWS Deep Learning Containers: |
| 37 | + |
| 38 | + |
| 39 | +* PyTorch 1.10.2 |
| 40 | + |
| 41 | + .. code:: |
| 42 | +
|
| 43 | + 763104351884.dkr.ecr.<region>.amazonaws.com/pytorch-training:1.10.2-gpu-py38-cu113-ubuntu20.04-sagemaker |
| 44 | +
|
| 45 | +
|
| 46 | +---- |
| 47 | + |
| 48 | +Release History |
| 49 | +=============== |
| 50 | + |
| 51 | +Sagemaker Distributed Model Parallel 1.6.0 Release Notes |
| 52 | +-------------------------------------------------------- |
| 53 | + |
4 | 54 | *Date: December. 20. 2021*
|
5 | 55 |
|
6 | 56 | **New Features**
|
@@ -30,10 +80,7 @@ AWS Deep Learning Container(s):
|
30 | 80 |
|
31 | 81 | 763104351884.dkr.ecr.<region>.amazonaws.com/pytorch-training:1.8.1-gpu-py36-cu111-ubuntu18.04
|
32 | 82 |
|
33 |
| ----- |
34 | 83 |
|
35 |
| -Release History |
36 |
| -=============== |
37 | 84 |
|
38 | 85 | Sagemaker Distributed Model Parallel 1.5.0 Release Notes
|
39 | 86 | --------------------------------------------------------
|
|
0 commit comments