-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Tune (local mode) support for Jumpstart Models #4532
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## master #4532 +/- ##
==========================================
+ Coverage 87.38% 87.44% +0.06%
==========================================
Files 389 389
Lines 36776 36881 +105
==========================================
+ Hits 32135 32249 +114
+ Misses 4641 4632 -9 ☔ View full report in Codecov by Sentry. |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
Refactoring Refactoring Refactoring
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
AWS CodeBuild CI Report
Powered by github-codebuild-logs, available on the AWS Serverless Application Repository |
Overriding codecov/patch since coverage is affected by logger.info & logger.warning lines |
* JumpStart TGI tune support * TGI does not enable multiple model serving internal to the container * Tune support for JS model with DJL DLC * Pretty Print JS TGI Benchmark results * Pretty Print JS DJL Benchmark results * Debugging * Refactoring * Revert "Refactoring" This reverts commit 4be1679. * refactoring * Revert "Debugging" This reverts commit 1ec29e3. * Add Tests * Refactoring * Refactoring * Debugging * Refactoring and Debugging * Refactoring * Refactoring Refactoring Refactoring Refactoring * Refactoring * Add unit tests * Address PR Review Comments * Sharded is not supported for all model types * Refactoring * Refactoring * Refactoring * Refactoring * Refactoring * Addressed PR Review comments * Sharding support validation for TGI * Fix unit tests * Increase code coverage --------- Co-authored-by: Jonathan Makunga <[email protected]>
* JumpStart TGI tune support * TGI does not enable multiple model serving internal to the container * Tune support for JS model with DJL DLC * Pretty Print JS TGI Benchmark results * Pretty Print JS DJL Benchmark results * Debugging * Refactoring * Revert "Refactoring" This reverts commit 4be1679. * refactoring * Revert "Debugging" This reverts commit 1ec29e3. * Add Tests * Refactoring * Refactoring * Debugging * Refactoring and Debugging * Refactoring * Refactoring Refactoring Refactoring Refactoring * Refactoring * Add unit tests * Address PR Review Comments * Sharded is not supported for all model types * Refactoring * Refactoring * Refactoring * Refactoring * Refactoring * Addressed PR Review comments * Sharding support validation for TGI * Fix unit tests * Increase code coverage --------- Co-authored-by: Jonathan Makunga <[email protected]>
Issue #, if available:
Description of changes:
Currently, ModelBuilder provides tune capability only for HF models. This PR extends tune capability to JumpStart Models as well.
To leverage this feature, users bring in JumpStart model id, and create
model
with ModelBuilder.This feature is only for
Local_Mode
.Testing done:
Merge Checklist
Put an
x
in the boxes that apply. You can also fill these out after creating the PR. If you're unsure about any of them, don't hesitate to ask. We're here to help! This is simply a reminder of what we are going to look for before merging your pull request.General
Tests
unique_name_from_base
to create resource names in integ tests (if appropriate)By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.