Skip to content

Commit 2b71ab4

Browse files
beniericCaptainianikhil-skNikhil Kulkarnijessicazhu3
authored
Master benchmark feature (#4672)
* fix: mainline alt config parsing (#4602) * fix: parsing * fix: commit tests * fix: types * updated * fix * Add Triton v24.03 URI (#4605) Co-authored-by: Nikhil Kulkarni <[email protected]> * feature: support session tag chaining for training job (#4596) * feature: support session tag chaining for training job * fix: resolve typo * fix: resolve typo and build failure * fix: resolve typo and unit test failure --------- Co-authored-by: Jessica Zhu <jessicazhu3@[email protected]> * prepare release v2.217.0 * update development version to v2.217.1.dev0 * fix: properly close files in lineage queries and tests (#4587) Closes #4458 * feature: set default allow_pickle param to False (#4557) * breaking: set default allow_pickle param to False * breaking: fix unit tests and linting NumpyDeserializer will not allow deserialization unless allow_pickle flag is set to True explicitly * fix: black-check --------- Co-authored-by: Ashwin Krishna <[email protected]> * Fix:invalid component error with new metadata (#4634) * fix: invalid component name * tests * format * fix vulnerable model integ tests llama 2 * updated * fix: training dataset location * prepare release v2.218.0 * update development version to v2.218.1.dev0 * chore: update skipped flaky tests (#4644) * Update skipped flaky tests * flake8 * format * format * chore: release tgi 2.0.1 (#4642) * chore: release tgi 2.0.1 * minor fix --------- Co-authored-by: Zhaoqi <[email protected]> * fix: Fix UserAgent logging in Python SDK (#4647) * prepare release v2.218.1 * update development version to v2.218.2.dev0 * feature: allow choosing js payload by alias in private method * Updates for SMP v2.3.1 (#4660) Co-authored-by: Suhit Kodgule <[email protected]> * chore(deps): bump jinja2 from 3.1.3 to 3.1.4 in /doc (#4655) Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.3 to 3.1.4. - [Release notes](https://github.com/pallets/jinja/releases) - [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst) - [Commits](pallets/jinja@3.1.3...3.1.4) --- updated-dependencies: - dependency-name: jinja2 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * chore(deps): bump tqdm from 4.66.2 to 4.66.3 in /tests/data/serve_resources/mlflow/pytorch (#4650) Bumps [tqdm](https://github.com/tqdm/tqdm) from 4.66.2 to 4.66.3. - [Release notes](https://github.com/tqdm/tqdm/releases) - [Commits](tqdm/tqdm@v4.66.2...v4.66.3) --- updated-dependencies: - dependency-name: tqdm dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * chore(deps): bump jinja2 from 3.1.3 to 3.1.4 in /requirements/extras (#4654) Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.3 to 3.1.4. - [Release notes](https://github.com/pallets/jinja/releases) - [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst) - [Commits](pallets/jinja@3.1.3...3.1.4) --- updated-dependencies: - dependency-name: jinja2 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> * prepare release v2.219.0 * update development version to v2.219.1.dev0 * fix: skip flakey tests pending investigation (#4667) * change: update image_uri_configs 05-09-2024 07:17:41 PST * Add tensorflow_serving support for mlflow models and enable lineage tracking for mlflow models (#4662) * Initial commit for tensorflow_serving support of MLflow * Add integ tests for mlflow tf_serving * fix style issues * remove unused attributes from tf builder * Add deep ping for tf_serving local mode * Initial commit for lineage impl * Initial commit for tensorflow_serving support of MLflow * Add integ tests for mlflow tf_serving * fix style issues * remove unused attributes from tf builder * Add deep ping for tf_serving local mode * Add integ tests and uts * fix local mode for tf_serving * Allow lineage tracking only in sagemaker endpoint mode * fix regex pattern * fix style issues * fix regex pattern and hard coded py version in ut * fix missing session * Resolve pr comments and fix regex for mlflow registry and ids * fix: model builder race condition on sagemaker session (#4673) Co-authored-by: Jonathan Makunga <[email protected]> * feat: Add telemetry support for mlflow models (#4674) * Initial commit for telemetry support * Fix style issues and add more logger messages * fix value error messages in ut * feat: add new images for HF TGI release (#4677) * chore: add new images for HF TGI release * test * feature: AutoGluon 1.1.0 image_uris update (#4679) Co-authored-by: Ubuntu <[email protected]> * change: add debug logs to workflow container dist creation (#4682) * prepare release v2.220.0 * update development version to v2.220.1.dev0 * fix: Image URI should take precedence for HF models (#4684) * Fix: Image URI should take precedence for HF models * Fix formatting * Fix formatting * Fix formatting * Increase coverage - UT pass * feat: support config_name in all JumpStart interfaces (#4583) (#4607) * add-config-name * address comments * updates for set config * docstyle * updates * fix * format * format * remove tests * Add ReadOnly APIs (#4606) * Add ReadOnly APIs * Resolving PR review comments * Resolve PR review comments * Refactoring * Refactoring * Add Caching * Refactore * Resolving conflicts * Add Unit Tests * Fix Unit Tests * Fix unit tests * Fix UT * Refactoring * Fix Integ tests * refactoring after Notebook testing * Fix code styles --------- Co-authored-by: Jonathan Makunga <[email protected]> * feat: tag JumpStart resource with config names (#4608) * tag config name * format * resolving comments * format * format * update * fix * format * updates inference component config name * fix: tests * ModelBuilder: Add functionalities to get and set deployment config. (#4614) * Add funtionalities to get and set deployment config * Resolve PR comments * ModelBuilder-JS * Add Unit tests * Refactoring * Testing with Notebook * Test backward compatibility * Remove Accelerated column if all not enabled * Fix docstring * Resolved PR Review comments * Docstring * increase code coverage --------- Co-authored-by: Jonathan Makunga <[email protected]> * Benchmark feature v2 (#4618) * Add funtionalities to get and set deployment config * Resolve PR comments * ModelBuilder-JS * Add Unit tests * Refactoring * Testing with Notebook * Test backward compatibility * Remove Accelerated column if all not enabled * Fix docstring * Resolved PR Review comments * Docstring * increase code coverage * Testing fix with Notebook * Only fetch instance rate metrics if not present * Increase code coverage --------- Co-authored-by: Jonathan Makunga <[email protected]> * fix: populate default config name to model (#4617) * fix: populate default config name to model * update condition * fix * format * flake8 * fix tests * fix coverage * temporarily skip integ test vulnerbility * fix tolerate attach method * format * fix predictor * format * Fix fetch instance rate bug (#4624) Co-authored-by: Jonathan Makunga <[email protected]> * chore: require config name and instance type in set_deployment_config (#4625) * require config_name and instance_type in set config * docstring * add supported instance types check * add more tests * format * fix tests * Deployment Configs - Follow-ups (#4626) * Init Deployment configs outside Model init. * Testing with NB * Testing with NB-V2 * Refactoring, NB testing * NB Testing and Refactoring * Testing * Refactoring * Testing with NB * Debug * Debug display API * Debug with NB * Testing with NB * Refactoring * Refactoring * Refactoring and NB testing * Testing with NB * Refactoring * Prefix instance type with ml * Fix unit tests --------- Co-authored-by: Jonathan Makunga <[email protected]> * fix: use different separator to flatten dict (#4629) * Use separate tags for inference and training configs (#4635) * Use separate tags for inference and training * format * format * format * format * Add supported inference and incremental training configs (#4637) * supported inference configs * add tests * format * tests * tests * address comments * format and address comments * updates * formt * format * Benchmark feature fixes (#4632) * Filter down Benchmark Metrics * Filter down Benchmark Metrics * Testing NB * Testing MB * Testing * Refactoring * Unit tests * Display instance type first, and instance rate last * Display unbalanced metrics * Testing with NB * Testing with NB * Debug * Debug * Testing with NB * Testing with NB * Testing with NB * Refactoring * Refactoring * Refactoring * Unit tests * Custom lru * Custom lru * Custom lru * Custom lru * Custom lru * Custom lru * Custom lru * Custom lru * Custom lru * Custom lru * Refactoring * Debug * Config ranking * Debug * Debug * Debug * Debug * Debug * Ranking * Ranking-Debug * Ranking-Debug * Ranking-Debug * Ranking-Debug * Ranking-Debug * Ranking-Debug * Debug * Debug * Debug * Debug * Refactoring * Contact JumpStart team to fix flaky test. test_list_jumpstart_models_script_filter --------- Co-authored-by: Jonathan Makunga <[email protected]> * fix: typo and merge with master branch (#4649) * Merge master into benchmark feature (#4652) * Merge master into master-benchmark-feature (#4656) * Master benchmark feature (#4658) * Remove duplicate line in types.py * Remove duplicate lines * Remove duplicate lines * Remove duplicate lines * Remove duplicate lines * fix unit test --------- Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: Haotian An <[email protected]> Co-authored-by: Nikhil Kulkarni <[email protected]> Co-authored-by: Nikhil Kulkarni <[email protected]> Co-authored-by: jessicazhu3 <[email protected]> Co-authored-by: Jessica Zhu <jessicazhu3@[email protected]> Co-authored-by: ci <ci> Co-authored-by: Justin <[email protected]> Co-authored-by: ASHWIN KRISHNA <[email protected]> Co-authored-by: Ashwin Krishna <[email protected]> Co-authored-by: Haixin Wang <[email protected]> Co-authored-by: Zhaoqi <[email protected]> Co-authored-by: Kalyani Nikure <[email protected]> Co-authored-by: Keerthan Vasist <[email protected]> Co-authored-by: SuhitK <[email protected]> Co-authored-by: Suhit Kodgule <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: sagemaker-bot <[email protected]> Co-authored-by: jiapinw <[email protected]> Co-authored-by: Jonathan Makunga <[email protected]> Co-authored-by: Jonathan Makunga <[email protected]> Co-authored-by: Prateek M Desai <[email protected]> Co-authored-by: Ubuntu <[email protected]> Co-authored-by: Mufaddal Rohawala <[email protected]> Co-authored-by: Samrudhi Sharma <[email protected]> Co-authored-by: evakravi <[email protected]>
1 parent ad30b85 commit 2b71ab4

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

53 files changed

+3930
-1138
lines changed

CHANGELOG.md

Lines changed: 29 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,34 @@
11
# Changelog
22

3+
## v2.220.0 (2024-05-15)
4+
5+
### Features
6+
7+
* AutoGluon 1.1.0 image_uris update
8+
* add new images for HF TGI release
9+
* Add telemetry support for mlflow models
10+
11+
### Bug Fixes and Other Changes
12+
13+
* add debug logs to workflow container dist creation
14+
* model builder race condition on sagemaker session
15+
* Add tensorflow_serving support for mlflow models and enable lineage tracking for mlflow models
16+
* update image_uri_configs 05-09-2024 07:17:41 PST
17+
* skip flakey tests pending investigation
18+
19+
## v2.219.0 (2024-05-08)
20+
21+
### Features
22+
23+
* allow choosing js payload by alias in private method
24+
25+
### Bug Fixes and Other Changes
26+
27+
* chore(deps): bump jinja2 from 3.1.3 to 3.1.4 in /requirements/extras
28+
* chore(deps): bump tqdm from 4.66.2 to 4.66.3 in /tests/data/serve_resources/mlflow/pytorch
29+
* chore(deps): bump jinja2 from 3.1.3 to 3.1.4 in /doc
30+
* Updates for SMP v2.3.1
31+
332
## v2.218.1 (2024-05-03)
433

534
### Bug Fixes and Other Changes

VERSION

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
2.218.2.dev0
1+
2.220.1.dev0

doc/requirements.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,6 @@ sphinx==5.1.1
22
sphinx-rtd-theme==0.5.0
33
docutils==0.15.2
44
packaging==20.9
5-
jinja2==3.1.3
5+
jinja2==3.1.4
66
schema==0.7.5
77
accelerate>=0.24.1,<=0.27.0

requirements/extras/test_requirements.txt

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ attrs>=23.1.0,<24
1818
fabric==2.6.0
1919
requests==2.31.0
2020
sagemaker-experiments==0.1.35
21-
Jinja2==3.1.3
21+
Jinja2==3.1.4
2222
pyvis==0.2.1
2323
pandas>=1.3.5,<1.5
2424
scikit-learn==1.3.0
@@ -36,3 +36,4 @@ onnx>=1.15.0
3636
nbformat>=5.9,<6
3737
accelerate>=0.24.1,<=0.27.0
3838
schema==0.7.5
39+
tensorflow>=2.1,<=2.16

src/sagemaker/fw_utils.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -142,7 +142,6 @@
142142
"2.1.0",
143143
"2.1.2",
144144
"2.2.0",
145-
"2.3.0",
146145
],
147146
}
148147

@@ -170,6 +169,7 @@
170169
"2.1.2",
171170
"2.2.0",
172171
"2.3.0",
172+
"2.3.1",
173173
]
174174

175175
TRAINIUM_SUPPORTED_DISTRIBUTION_STRATEGIES = ["torch_distributed"]

src/sagemaker/image_uri_config/autogluon.json

Lines changed: 88 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,8 @@
1111
"0.6": "0.6.2",
1212
"0.7": "0.7.0",
1313
"0.8": "0.8.2",
14-
"1.0": "1.0.0"
14+
"1.0": "1.0.0",
15+
"1.1": "1.1.0"
1516
},
1617
"versions": {
1718
"0.3.1": {
@@ -480,6 +481,47 @@
480481
"py_versions": [
481482
"py310"
482483
]
484+
},
485+
"1.1.0": {
486+
"registries": {
487+
"af-south-1": "626614931356",
488+
"il-central-1": "780543022126",
489+
"ap-east-1": "871362719292",
490+
"ap-northeast-1": "763104351884",
491+
"ap-northeast-2": "763104351884",
492+
"ap-northeast-3": "364406365360",
493+
"ap-south-1": "763104351884",
494+
"ap-southeast-1": "763104351884",
495+
"ap-southeast-2": "763104351884",
496+
"ap-southeast-3": "907027046896",
497+
"ap-southeast-4": "457447274322",
498+
"ca-central-1": "763104351884",
499+
"eu-central-1": "763104351884",
500+
"eu-north-1": "763104351884",
501+
"eu-west-1": "763104351884",
502+
"eu-west-2": "763104351884",
503+
"eu-west-3": "763104351884",
504+
"eu-south-1": "692866216735",
505+
"me-south-1": "217643126080",
506+
"sa-east-1": "763104351884",
507+
"us-east-1": "763104351884",
508+
"us-east-2": "763104351884",
509+
"us-gov-east-1": "446045086412",
510+
"us-gov-west-1": "442386744353",
511+
"us-iso-east-1": "886529160074",
512+
"us-isob-east-1": "094389454867",
513+
"us-west-1": "763104351884",
514+
"us-west-2": "763104351884",
515+
"ca-west-1": "204538143572"
516+
},
517+
"repository": "autogluon-training",
518+
"processors": [
519+
"cpu",
520+
"gpu"
521+
],
522+
"py_versions": [
523+
"py310"
524+
]
483525
}
484526
}
485527
},
@@ -491,7 +533,8 @@
491533
"0.6": "0.6.2",
492534
"0.7": "0.7.0",
493535
"0.8": "0.8.2",
494-
"1.0": "1.0.0"
536+
"1.0": "1.0.0",
537+
"1.1": "1.1.0"
495538
},
496539
"versions": {
497540
"0.3.1": {
@@ -987,6 +1030,49 @@
9871030
"py_versions": [
9881031
"py310"
9891032
]
1033+
},
1034+
"1.1.0": {
1035+
"registries": {
1036+
"af-south-1": "626614931356",
1037+
"il-central-1": "780543022126",
1038+
"ap-east-1": "871362719292",
1039+
"ap-northeast-1": "763104351884",
1040+
"ap-northeast-2": "763104351884",
1041+
"ap-northeast-3": "364406365360",
1042+
"ap-south-1": "763104351884",
1043+
"ap-southeast-1": "763104351884",
1044+
"ap-southeast-2": "763104351884",
1045+
"ap-southeast-3": "907027046896",
1046+
"ap-southeast-4": "457447274322",
1047+
"ca-central-1": "763104351884",
1048+
"cn-north-1": "727897471807",
1049+
"cn-northwest-1": "727897471807",
1050+
"eu-central-1": "763104351884",
1051+
"eu-north-1": "763104351884",
1052+
"eu-west-1": "763104351884",
1053+
"eu-west-2": "763104351884",
1054+
"eu-west-3": "763104351884",
1055+
"eu-south-1": "692866216735",
1056+
"me-south-1": "217643126080",
1057+
"sa-east-1": "763104351884",
1058+
"us-east-1": "763104351884",
1059+
"us-east-2": "763104351884",
1060+
"us-gov-east-1": "446045086412",
1061+
"us-gov-west-1": "442386744353",
1062+
"us-iso-east-1": "886529160074",
1063+
"us-isob-east-1": "094389454867",
1064+
"us-west-1": "763104351884",
1065+
"us-west-2": "763104351884",
1066+
"ca-west-1": "204538143572"
1067+
},
1068+
"repository": "autogluon-inference",
1069+
"processors": [
1070+
"cpu",
1071+
"gpu"
1072+
],
1073+
"py_versions": [
1074+
"py310"
1075+
]
9901076
}
9911077
}
9921078
}

src/sagemaker/image_uri_config/huggingface-llm-neuronx.json

Lines changed: 30 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@
44
"inf2"
55
],
66
"version_aliases": {
7-
"0.0": "0.0.16"
7+
"0.0": "0.0.22"
88
},
99
"versions": {
1010
"0.0.16": {
@@ -180,6 +180,35 @@
180180
"container_version": {
181181
"inf2": "ubuntu22.04"
182182
}
183+
},
184+
"0.0.22": {
185+
"py_versions": [
186+
"py310"
187+
],
188+
"registries": {
189+
"ap-northeast-1": "763104351884",
190+
"ap-south-1": "763104351884",
191+
"ap-south-2": "772153158452",
192+
"ap-southeast-1": "763104351884",
193+
"ap-southeast-2": "763104351884",
194+
"ap-southeast-4": "457447274322",
195+
"eu-central-1": "763104351884",
196+
"eu-central-2": "380420809688",
197+
"eu-south-2": "503227376785",
198+
"eu-west-1": "763104351884",
199+
"eu-west-3": "763104351884",
200+
"il-central-1": "780543022126",
201+
"sa-east-1": "763104351884",
202+
"us-east-1": "763104351884",
203+
"us-east-2": "763104351884",
204+
"us-west-2": "763104351884",
205+
"ca-west-1": "204538143572"
206+
},
207+
"tag_prefix": "2.1.2-optimum0.0.22",
208+
"repository": "huggingface-pytorch-tgi-inference",
209+
"container_version": {
210+
"inf2": "ubuntu22.04"
211+
}
183212
}
184213
}
185214
}

src/sagemaker/image_uri_config/huggingface-llm.json

Lines changed: 48 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@
1212
"1.2": "1.2.0",
1313
"1.3": "1.3.3",
1414
"1.4": "1.4.5",
15-
"2.0": "2.0.1"
15+
"2.0": "2.0.2"
1616
},
1717
"versions": {
1818
"0.6.0": {
@@ -625,6 +625,53 @@
625625
"container_version": {
626626
"gpu": "cu121-ubuntu22.04"
627627
}
628+
},
629+
"2.0.2": {
630+
"py_versions": [
631+
"py310"
632+
],
633+
"registries": {
634+
"af-south-1": "626614931356",
635+
"il-central-1": "780543022126",
636+
"ap-east-1": "871362719292",
637+
"ap-northeast-1": "763104351884",
638+
"ap-northeast-2": "763104351884",
639+
"ap-northeast-3": "364406365360",
640+
"ap-south-1": "763104351884",
641+
"ap-south-2": "772153158452",
642+
"ap-southeast-1": "763104351884",
643+
"ap-southeast-2": "763104351884",
644+
"ap-southeast-3": "907027046896",
645+
"ap-southeast-4": "457447274322",
646+
"ca-central-1": "763104351884",
647+
"cn-north-1": "727897471807",
648+
"cn-northwest-1": "727897471807",
649+
"eu-central-1": "763104351884",
650+
"eu-central-2": "380420809688",
651+
"eu-north-1": "763104351884",
652+
"eu-west-1": "763104351884",
653+
"eu-west-2": "763104351884",
654+
"eu-west-3": "763104351884",
655+
"eu-south-1": "692866216735",
656+
"eu-south-2": "503227376785",
657+
"me-south-1": "217643126080",
658+
"me-central-1": "914824155844",
659+
"sa-east-1": "763104351884",
660+
"us-east-1": "763104351884",
661+
"us-east-2": "763104351884",
662+
"us-gov-east-1": "446045086412",
663+
"us-gov-west-1": "442386744353",
664+
"us-iso-east-1": "886529160074",
665+
"us-isob-east-1": "094389454867",
666+
"us-west-1": "763104351884",
667+
"us-west-2": "763104351884",
668+
"ca-west-1": "204538143572"
669+
},
670+
"tag_prefix": "2.3.0-tgi2.0.2",
671+
"repository": "huggingface-pytorch-tgi-inference",
672+
"container_version": {
673+
"gpu": "cu121-ubuntu22.04"
674+
}
628675
}
629676
}
630677
}

0 commit comments

Comments
 (0)