Skip to content

Commit 4cb56d6

Browse files
committed
fix doc structure
1 parent 92d0627 commit 4cb56d6

File tree

3 files changed

+54
-24
lines changed

3 files changed

+54
-24
lines changed

doc/api/training/distributed.rst

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -26,3 +26,6 @@ The SageMaker Distributed Model Parallel Library
2626
:maxdepth: 3
2727

2828
smd_model_parallel
29+
smp_versions/latest
30+
smd_model_parallel_general
31+
smd_model_parallel_release_notes/smd_model_parallel_change_log

doc/api/training/smd_model_parallel.rst

Lines changed: 0 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -9,15 +9,6 @@ allowing you to increase prediction accuracy by creating larger models with more
99
You can use the library to automatically partition your existing TensorFlow and PyTorch workloads
1010
across multiple GPUs with minimal code changes. The library's API can be accessed through the Amazon SageMaker SDK.
1111

12-
See the following sections to learn more about the SageMaker model parallel library APIs.
13-
14-
.. toctree::
15-
:maxdepth: 3
16-
17-
smp_versions/latest
18-
smd_model_parallel_general
19-
20-
2112
.. tip::
2213

2314
We recommended using this API documentation with the conceptual guide at
@@ -48,14 +39,3 @@ See the following sections to learn more about the SageMaker model parallel libr
4839
`Extend or Adapt A Docker Container that Contains the Model Parallel Library
4940
<https://integ-docs-aws.amazon.com/sagemaker/latest/dg/model-parallel-use-api.html#model-parallel-customize-container>`__
5041
for more information.
51-
52-
Release Notes
53-
=============
54-
55-
New features, bug fixes, and improvements are regularly made to the SageMaker
56-
distributed model parallel library.
57-
58-
.. toctree::
59-
:maxdepth: 1
60-
61-
smd_model_parallel_release_notes/smd_model_parallel_change_log

doc/api/training/smd_model_parallel_release_notes/smd_model_parallel_change_log.rst

Lines changed: 51 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,56 @@
1-
Sagemaker Distributed Model Parallel 1.6.0 Release Notes
1+
#############
2+
Release Notes
3+
#############
4+
5+
New features, bug fixes, and improvements are regularly made to the SageMaker
6+
distributed model parallel library.
7+
8+
Sagemaker Distributed Model Parallel 1.7.0 Release Notes
29
========================================================
310

11+
*Date: March. 07. 2022*
12+
13+
**New Features**
14+
15+
**Currency Updates**
16+
17+
* Support for PyTorch 1.10.2
18+
* Support for Hugging Face Transformers 4.16.2
19+
20+
**New Features**
21+
22+
* Support for query key layer scaling to avoid overflow for large model
23+
* Support for FP32 residual addition to avoid overflow (NaN loss values) for large models when using FP16.
24+
25+
**Improvements**
26+
27+
* Added support for a custom CUDA kernel for softmax to improve throughput
28+
* Added support for the following `NVIDIA Megatron’s fused kernels
29+
<https://github.com/NVIDIA/Megatron-LM/tree/main/megatron/fused_kernels>`_:
30+
31+
* Fusion of attention masking and softmax
32+
* Fusion of bias addition and Gelu activation
33+
34+
**Migration to AWS Deep Learning Containers**
35+
36+
This version passed benchmark testing and is migrated to the following AWS Deep Learning Containers:
37+
38+
39+
* PyTorch 1.10.2
40+
41+
.. code::
42+
43+
763104351884.dkr.ecr.<region>.amazonaws.com/pytorch-training:1.10.2-gpu-py38-cu113-ubuntu20.04-sagemaker
44+
45+
46+
----
47+
48+
Release History
49+
===============
50+
51+
Sagemaker Distributed Model Parallel 1.6.0 Release Notes
52+
--------------------------------------------------------
53+
454
*Date: December. 20. 2021*
555

656
**New Features**
@@ -30,10 +80,7 @@ AWS Deep Learning Container(s):
3080
3181
763104351884.dkr.ecr.<region>.amazonaws.com/pytorch-training:1.8.1-gpu-py36-cu111-ubuntu18.04
3282
33-
----
3483
35-
Release History
36-
===============
3784
3885
Sagemaker Distributed Model Parallel 1.5.0 Release Notes
3986
--------------------------------------------------------

0 commit comments

Comments
 (0)