Skip to content

Commit 21c1ccf

Browse files
committed
currency updates for the smdistributed libraries
1 parent b16630b commit 21c1ccf

File tree

3 files changed

+54
-9
lines changed

3 files changed

+54
-9
lines changed

doc/api/training/sdp_versions/latest.rst

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -26,8 +26,8 @@ depending on the version of the library you use.
2626
<https://docs.aws.amazon.com/sagemaker/latest/dg/data-parallel-use-api.html#data-parallel-use-python-skd-api>`_
2727
for more information.
2828

29-
Version 1.4.0, 1.4.1 (Latest)
30-
=============================
29+
Version 1.4.0, 1.4.1, 1.5.0 (Latest)
30+
====================================
3131

3232
.. toctree::
3333
:maxdepth: 1

doc/api/training/smd_data_parallel_release_notes/smd_data_parallel_change_log.rst

Lines changed: 40 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -7,9 +7,45 @@ Release Notes
77
New features, bug fixes, and improvements are regularly made to the SageMaker
88
distributed data parallel library.
99

10-
SageMaker Distributed Data Parallel 1.4.1 Release Notes
10+
SageMaker Distributed Data Parallel 1.5.0 Release Notes
1111
=======================================================
1212

13+
*Date: Jul. 26. 2022*
14+
15+
**Currency Updates**
16+
17+
* Added support for PyTorch 1.12.0.
18+
19+
**Bug Fixes**
20+
21+
* Improved stability for long-running training jobs.
22+
23+
24+
**Migration to AWS Deep Learning Containers**
25+
26+
This version passed benchmark testing and is migrated to the following AWS Deep Learning Containers (DLC):
27+
28+
- PyTorch 1.12.0 DLC
29+
30+
.. code::
31+
32+
763104351884.dkr.ecr.<region>.amazonaws.com/pytorch-training:1.12.0-gpu-py38-cu113-ubuntu20.04-sagemaker
33+
34+
Binary file of this version of the library for custom container users:
35+
36+
.. code::
37+
38+
https://smdataparallel.s3.amazonaws.com/binary/pytorch/1.12.0/cu113/2022-07-01/smdistributed_dataparallel-1.5.0-cp38-cp38-linux_x86_64.whl
39+
40+
41+
----
42+
43+
Release History
44+
===============
45+
46+
SageMaker Distributed Data Parallel 1.4.1 Release Notes
47+
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
48+
1349
*Date: May. 3. 2022*
1450

1551
**Currency Updates**
@@ -18,7 +54,9 @@ SageMaker Distributed Data Parallel 1.4.1 Release Notes
1854

1955
**Known Issues**
2056

21-
* The library currently does not support the PyTorch sub-process groups API (torch.distributed.new_group (https://pytorch.org/docs/stable/distributed.html#torch.distributed.new_group)).
57+
* The library currently does not support the PyTorch sub-process groups API
58+
(`torch.distributed.new_group
59+
<https://pytorch.org/docs/stable/distributed.html#torch.distributed.new_group>`_).
2260

2361

2462
**Migration to AWS Deep Learning Containers**
@@ -38,11 +76,6 @@ Binary file of this version of the library for custom container users:
3876
https://smdataparallel.s3.amazonaws.com/binary/pytorch/1.11.0/cu113/2022-04-14/smdistributed_dataparallel-1.4.1-cp38-cp38-linux_x86_64.whl
3977
4078
41-
----
42-
43-
Release History
44-
===============
45-
4679
SageMaker Distributed Data Parallel 1.4.0 Release Notes
4780
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
4881

doc/api/training/smd_model_parallel_release_notes/smd_model_parallel_change_log.rst

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -41,13 +41,25 @@ This version passed benchmark testing and is migrated to the following AWS Deep
4141
4242
763104351884.dkr.ecr.<region>.amazonaws.com/pytorch-training:1.11.0-gpu-py38-cu113-ubuntu20.04-sagemaker
4343
44+
- PyTorch 1.11.0 DLC
45+
46+
.. code::
47+
48+
763104351884.dkr.ecr.<region>.amazonaws.com/pytorch-training:1.12.0-gpu-py38-cu113-ubuntu20.04-sagemaker
49+
4450
Binary file of this version of the library for custom container users:
4551

52+
- For PyTorch 1.11.0
53+
4654
.. code::
4755
4856
https://sagemaker-distributed-model-parallel.s3.us-west-2.amazonaws.com/pytorch-1.11.0/build-artifacts/2022-07-11-19-23/smdistributed_modelparallel-1.10.0-cp38-cp38-linux_x86_64.whl
4957
58+
- For PyTorch 1.12.0
59+
60+
.. code::
5061
62+
https://sagemaker-distributed-model-parallel.s3.us-west-2.amazonaws.com/pytorch-1.12.0/build-artifacts/2022-07-11-19-23/smdistributed_modelparallel-1.10.0-cp38-cp38-linux_x86_64.whl
5163
5264
----
5365

0 commit comments

Comments
 (0)