-
Notifications
You must be signed in to change notification settings - Fork 256
[DOCS] Documents that models with one allocation might have downtime #2567
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Documentation preview: |
Hmm, I think this case is for serverless projects only, is it? @droberts195 ? |
No, it affects all types of installation - on-premise, classic Cloud and serverless. If a model is only running on one node then it won't be able to service requests when that node gets restarted for any reason. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. 👍
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
…2567) (#2573) Co-authored-by: István Zoltán Szabó <[email protected]>
…2567) (#2574) Co-authored-by: István Zoltán Szabó <[email protected]>
…2567) (#2575) Co-authored-by: István Zoltán Szabó <[email protected]>
Overview
This PR adds an
IMPORTANT
block to the trained model deployment section that explains that users will experience downtime in the service of their trained model if the model has only one allocation deployed.Preview
Deploy the model to your cluster