Skip to content

Commit 4434f0a

Browse files
author
Dan
authored
Merge branch 'master' into patch-1
2 parents 43bbd84 + 81af40d commit 4434f0a

File tree

129 files changed

+4378
-650
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

129 files changed

+4378
-650
lines changed

CHANGELOG.md

Lines changed: 197 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,202 @@
11
# Changelog
22

3+
## v2.27.0 (2021-03-01)
4+
5+
### Features
6+
7+
* add inference_id to predict
8+
9+
### Bug Fixes and Other Changes
10+
11+
* disable profiler by default for regions not support it
12+
13+
### Documentation Changes
14+
15+
* add TF 2.4.1 support to sm distributed data parallel docs and other updates
16+
17+
## v2.26.0 (2021-02-26)
18+
19+
### Features
20+
21+
* Add Framework Version support for PyTorch compilation (Neo)
22+
23+
### Bug Fixes and Other Changes
24+
25+
* add mxnet 1.7.0 eia configuration
26+
* update source constructor for lineage action and artifact
27+
28+
### Documentation Changes
29+
30+
* fix typo in create_monitoring_schedule method
31+
32+
## v2.25.2 (2021-02-25)
33+
34+
### Bug Fixes and Other Changes
35+
36+
* Use the output path to store the Clarify config file
37+
* feature group should ignore nan values
38+
* ignore failing smdataparallel test
39+
* Add tests for Training job & Transform job in visualizer
40+
* visualizer for pipeline processing job steps
41+
42+
### Documentation Changes
43+
44+
* update doc for Elastic Inference MXNet 1.7.0
45+
46+
## v2.25.1 (2021-02-20)
47+
48+
### Bug Fixes and Other Changes
49+
50+
* Add tests for visualizer to improve test coverage
51+
52+
### Documentation Changes
53+
54+
* specify correct return type
55+
56+
### Testing and Release Infrastructure
57+
58+
* rename canary_quick pytest mark to release
59+
60+
## v2.25.0 (2021-02-19)
61+
62+
### Features
63+
64+
* Enable step caching
65+
* Add other Neo supported regions for Inferentia inference images
66+
67+
### Bug Fixes and Other Changes
68+
69+
* remove FailStep from pipelines
70+
* use sagemaker_session in workflow tests
71+
* use ECR public for multidatamodel tests
72+
* add the mapping from py3 to cuda11 images
73+
* Add 30s cap time for tag tests
74+
* add build spec for slow tests
75+
* mark top 10 slow tests
76+
* remove slow test_run_xxx_monitor_baseline tests
77+
* pin astroid to 2.4.2
78+
79+
### Testing and Release Infrastructure
80+
81+
* unmark more flaky integ tests
82+
* remove canary_quick pytest mark from flaky/unnecessary tests
83+
* remove python3.8 from buildspec
84+
* remove py38 tox env
85+
* fix release buildspec typo
86+
* unblock regional release builds
87+
* lower test TPS for experiment analytics
88+
* move package preparation and publishing to the deploy step
89+
90+
## v2.24.5 (2021-02-12)
91+
92+
### Bug Fixes and Other Changes
93+
94+
* test_tag/test_tags method assert fix in association tests
95+
96+
### Documentation Changes
97+
98+
* removing mention of TF 2.4 from SM distributed model parallel docs
99+
* adding details about mpi options, other small updates
100+
101+
## v2.24.4 (2021-02-09)
102+
103+
### Bug Fixes and Other Changes
104+
105+
* add integration test for listing artifacts by type
106+
* List Associations integ tests
107+
108+
## v2.24.3 (2021-02-04)
109+
110+
### Bug Fixes and Other Changes
111+
112+
* Remove pytest fixture and fix test_tag/s method
113+
114+
## v2.24.2 (2021-02-03)
115+
116+
### Bug Fixes and Other Changes
117+
118+
* use 3.5 version of get-pip.py
119+
* SM DDP release notes/changelog files
120+
121+
### Documentation Changes
122+
123+
* adding versioning to sm distributed data parallel docs
124+
125+
## v2.24.1 (2021-01-28)
126+
127+
### Bug Fixes and Other Changes
128+
129+
* fix collect-tests tox env
130+
* create profiler specific unsupported regions
131+
* Update smd_model_parallel_pytorch.rst
132+
133+
## v2.24.0 (2021-01-22)
134+
135+
### Features
136+
137+
* add support for Std:Join for pipelines
138+
* Map image name to image uri
139+
* friendly names for short URIs
140+
141+
### Bug Fixes and Other Changes
142+
143+
* increase allowed time for search to get updated
144+
* refactor distribution config construction
145+
146+
### Documentation Changes
147+
148+
* Add SMP 1.2.0 API docs
149+
150+
## v2.23.6 (2021-01-20)
151+
152+
### Bug Fixes and Other Changes
153+
154+
* add artifact, action, context to virsualizer
155+
156+
## v2.23.5 (2021-01-18)
157+
158+
### Bug Fixes and Other Changes
159+
160+
* increase time allowed for trial components to index
161+
162+
## v2.23.4.post0 (2021-01-14)
163+
164+
### Documentation Changes
165+
166+
* update predict_fn implementation for PyTorch EIA 1.5.1
167+
168+
## v2.23.4 (2021-01-13)
169+
170+
### Bug Fixes and Other Changes
171+
172+
* remove captureWarninig setting
173+
174+
## v2.23.3 (2021-01-12)
175+
176+
### Bug Fixes and Other Changes
177+
178+
* improve optional dependency error message
179+
* add debugger rule container account in PDT
180+
* assert step execution first in pipeline test
181+
* add service inserted fields to generated Hive DDL
182+
183+
### Documentation Changes
184+
185+
* fix description for max_wait
186+
* use correct classpath in V2 alias documentation.
187+
* Bad arg name in feat-store ingestion manager
188+
189+
## v2.23.2 (2021-01-06)
190+
191+
### Bug Fixes and Other Changes
192+
193+
* remove shell=True in subprocess.check_output
194+
* use SecurityConfig dict key
195+
196+
### Documentation Changes
197+
198+
* remove D212 from ignore to comply with PEP257 standards
199+
3200
## v2.23.1 (2020-12-29)
4201

5202
### Bug Fixes and Other Changes

VERSION

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
2.23.2.dev0
1+
2.27.1.dev0

buildspec-deploy.yml

Lines changed: 10 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,16 @@ version: 0.2
33
phases:
44
build:
55
commands:
6-
- PACKAGE_FILE="$CODEBUILD_SRC_DIR_ARTIFACT_1/sagemaker-*.tar.gz"
6+
# prepare the release (update versions, changelog etc.)
7+
- git-release --prepare
8+
9+
# generate the distribution package
10+
- python3 setup.py sdist
11+
12+
# publish the release to github
13+
- git-release --publish
14+
15+
- PACKAGE_FILE="dist/sagemaker-*.tar.gz"
716
- PYPI_USER=$(aws secretsmanager get-secret-value --secret-id /codebuild/pypi/user --query SecretString --output text)
817
- PYPI_PASSWORD=$(aws secretsmanager get-secret-value --secret-id /codebuild/pypi/password --query SecretString --output text)
918
- GPG_PRIVATE_KEY=$(aws secretsmanager get-secret-value --secret-id /codebuild/gpg/private_key --query SecretString --output text)

buildspec-localmodetests.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -11,5 +11,5 @@ phases:
1111

1212
# local mode tests
1313
- start_time=`date +%s`
14-
- execute-command-if-has-matching-changes "tox -e py38 -- tests/integ -m local_mode --durations 50" "tests/integ" "tests/data" "tests/conftest.py" "tests/__init__.py" "src/*.py" "setup.py" "setup.cfg" "buildspec-localmodetests.yml"
15-
- ./ci-scripts/displaytime.sh 'py38 local mode' $start_time
14+
- execute-command-if-has-matching-changes "tox -e py37 -- tests/integ -m local_mode --durations 50" "tests/integ" "tests/data" "tests/conftest.py" "tests/__init__.py" "src/*.py" "setup.py" "setup.cfg" "buildspec-localmodetests.yml"
15+
- ./ci-scripts/displaytime.sh 'py37 local mode' $start_time

buildspec-release.yml

Lines changed: 2 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -3,9 +3,6 @@ version: 0.2
33
phases:
44
build:
55
commands:
6-
# prepare the release (update versions, changelog etc.)
7-
- git-release --prepare
8-
96
# run linters
107
- tox -e flake8,pylint
118

@@ -18,19 +15,7 @@ phases:
1815
# run unit tests
1916
- AWS_ACCESS_KEY_ID= AWS_SECRET_ACCESS_KEY= AWS_SESSION_TOKEN=
2017
AWS_CONTAINER_CREDENTIALS_RELATIVE_URI= AWS_DEFAULT_REGION=
21-
tox -e py36,py37,py38 -- tests/unit
18+
tox -e py36,py37 -- tests/unit
2219

2320
# run a subset of the integration tests
24-
- IGNORE_COVERAGE=- tox -e py36 -- tests/integ -m canary_quick -n 64 --boxed --reruns 2
25-
26-
# generate the distribution package
27-
- python3 setup.py sdist
28-
29-
# publish the release to github
30-
- git-release --publish
31-
32-
artifacts:
33-
files:
34-
- dist/sagemaker-*.tar.gz
35-
name: ARTIFACT_1
36-
discard-paths: yes
21+
- IGNORE_COVERAGE=- tox -e py36 -- tests/integ -m "not (local_mode or slow_test)" -n 32 --boxed --reruns 2

buildspec-slowtests.yml

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,15 @@
1+
version: 0.2
2+
3+
phases:
4+
pre_build:
5+
commands:
6+
- start-dockerd
7+
8+
build:
9+
commands:
10+
- IGNORE_COVERAGE=-
11+
12+
# slow tests
13+
- start_time=`date +%s`
14+
- execute-command-if-has-matching-changes "tox -e py37 -- tests/integ -m slow_test -n 16 --durations 0" "tests/integ" "tests/data" "tests/conftest.py" "tests/__init__.py" "src/*.py" "setup.py" "setup.cfg" "buildspec-slowtests.yml"
15+
- ./ci-scripts/displaytime.sh 'py37 slow tests' $start_time

buildspec-unittests.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -18,5 +18,5 @@ phases:
1818
- start_time=`date +%s`
1919
- AWS_ACCESS_KEY_ID= AWS_SECRET_ACCESS_KEY= AWS_SESSION_TOKEN=
2020
AWS_CONTAINER_CREDENTIALS_RELATIVE_URI= AWS_DEFAULT_REGION=
21-
tox -e py36,py37,py38 --parallel all -- tests/unit
22-
- ./ci-scripts/displaytime.sh 'py36,py37,py38 unit' $start_time
21+
tox -e py36,py37 --parallel all -- tests/unit
22+
- ./ci-scripts/displaytime.sh 'py36,py37 unit' $start_time

buildspec.yml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -11,13 +11,13 @@ phases:
1111

1212
# run integration tests
1313
- start_time=`date +%s`
14-
- execute-command-if-has-matching-changes "python3.8 -u ci-scripts/queue_build.py" "tests/integ" "tests/scripts" "tests/data" "tests/conftest.py" "tests/__init__.py" "src/*.py" "setup.py" "setup.cfg" "buildspec.yml"
14+
- execute-command-if-has-matching-changes "python3.7 -u ci-scripts/queue_build.py" "tests/integ" "tests/scripts" "tests/data" "tests/conftest.py" "tests/__init__.py" "src/*.py" "setup.py" "setup.cfg" "buildspec.yml"
1515
- ./ci-scripts/displaytime.sh 'build queue' $start_time
1616

1717
- start_time=`date +%s`
1818
- |
19-
execute-command-if-has-matching-changes "env -u AWS_DEFAULT_REGION tox -e py38 -- tests/integ -m \"not local_mode and not cron\" -n 384 --reruns 3 --reruns-delay 15 --durations 50 --boto-config '{\"region_name\": \"us-east-2\"}'" "tests/integ" "tests/scripts" "tests/data" "tests/conftest.py" "tests/__init__.py" "src/*.py" "src/sagemaker/image_uri_config/*.json" "setup.py" "setup.cfg" "buildspec.yml"
20-
- ./ci-scripts/displaytime.sh 'py38 tests/integ' $start_time
19+
execute-command-if-has-matching-changes "env -u AWS_DEFAULT_REGION tox -e py37 -- tests/integ -m \"not local_mode and not cron and not slow_test\" -n 384 --reruns 3 --reruns-delay 15 --durations 50 --boto-config '{\"region_name\": \"us-east-2\"}'" "tests/integ" "tests/scripts" "tests/data" "tests/conftest.py" "tests/__init__.py" "src/*.py" "src/sagemaker/image_uri_config/*.json" "setup.py" "setup.cfg" "buildspec.yml"
20+
- ./ci-scripts/displaytime.sh 'py37 tests/integ' $start_time
2121

2222
post_build:
2323
finally:

doc/api/training/smd_data_parallel_pytorch.rst renamed to doc/api/training/sdp_versions/v1.0.0/smd_data_parallel_pytorch.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -155,7 +155,7 @@ PyTorch API
155155

156156
**Supported versions:**
157157

158-
- PyTorch 1.6
158+
- PyTorch 1.6.0
159159

160160

161161
.. function:: smdistributed.dataparallel.torch.distributed.is_available()

doc/api/training/smd_data_parallel_tensorflow.rst renamed to doc/api/training/sdp_versions/v1.0.0/smd_data_parallel_tensorflow.rst

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -414,7 +414,7 @@ TensorFlow API
414414

415415
.. function:: smdistributed.dataparallel.tensorflow.DistributedOptimizer
416416

417-
Applicable if you use the ``tf.estimator`` API in TensorFlow 2.x (2.3).
417+
Applicable if you use the ``tf.estimator`` API in TensorFlow 2.x (2.3.1).
418418
419419
Construct a new ``DistributedOptimizer`` , which uses TensorFlow
420420
optimizer under the hood for computing single-process gradient values
@@ -489,7 +489,7 @@ TensorFlow API
489489

490490
.. function:: smdistributed.dataparallel.tensorflow.BroadcastGlobalVariablesHook
491491

492-
Applicable if you use the ``tf.estimator`` API in TensorFlow 2.x (2.3).
492+
Applicable if you use the ``tf.estimator`` API in TensorFlow 2.x (2.3.1).
493493

494494

495495
``SessionRunHook`` that will broadcast all global variables from root
Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,9 @@
1+
2+
Version 1.0.0 (Latest)
3+
======================
4+
5+
.. toctree::
6+
:maxdepth: 1
7+
8+
v1.0.0/smd_data_parallel_pytorch.rst
9+
v1.0.0/smd_data_parallel_tensorflow.rst

0 commit comments

Comments
 (0)