Skip to content

v3: registry maintenance #1146

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 11 commits into from
Jun 5, 2024
Merged

v3: registry maintenance #1146

merged 11 commits into from
Jun 5, 2024

Conversation

nicktrn
Copy link
Collaborator

@nicktrn nicktrn commented Jun 5, 2024

This will allow us to set a failover registry to use during maintenance. Deploys will use the failover, checkpoints will be retried until maintenance completes. If checkpoints fail even after backoff, the runs should still resume normally, but containers will stay up until they do.

Other changes:

  • Improved logging of socket.io handler errors on webapp side
  • Handle remaining coordinator errors, don't just log
  • Increase cleanup IPC timeout (the only change that requires package release)

Copy link

changeset-bot bot commented Jun 5, 2024

🦋 Changeset detected

Latest commit: 4b2a41a

The changes in this PR will be included in the next version bump.

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

@nicktrn nicktrn merged commit 8e5ef17 into main Jun 5, 2024
2 checks passed
@nicktrn nicktrn deleted the v3/registry-maintenance branch June 5, 2024 10:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant