Add GitpodWorkspaceHighStartFailureRate alert #19099
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
This adds an alert which would have triggered for the incident inc-2023-11-20-workspace-start-failures-on-gitpod-cloud.
By having the alert use
for: 15m
we ensure it doesn't get triggered by smaller spikes. Such smaller spikes will be picked up by our SLO error budget instead.This alert uses an existing runbook which we improved to cover this case as well in https://github.com/gitpod-io/runbooks/pull/422
Paired with @WVerlaek
Summary generated by Copilot
🤖[deprecated] Generated by Copilot at 3d1220f
Add a new alert rule for high workspace start failure rate in
workspaces.yaml
. This rule helps to monitor and troubleshoot Gitpod workspaces reliability.Related Issue(s)
Part of follow-ups of inc-2023-11-20-workspace-start-failures-on-gitpod-cloud
How to test
N/A
Documentation
Preview status
gitpod:summary
Build Options
Build
Run the build with werft instead of GHA
Run Leeway with
--dont-test
Publish
Installer
Add desired feature flags to the end of the line above, space separated
Preview Environment / Integration Tests
If enabled this will build
install/preview
If enabled this will create the environment on GCE infra
Saves cost. Untick this only if you're really sure you need a non-preemtible machine.
Valid options are
all
,workspace
,webapp
,ide
,jetbrains
,vscode
,ssh
. If enabled,with-preview
andwith-large-vm
will be enabled.N/A