Skip to content

QQ: use a dedicated function for queue recovery after Ra system restart. #11809

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jul 24, 2024

Conversation

kjnilsson
Copy link
Contributor

Previously we used the registered approach where all Ra servers that have a registered name would be recovered. This could have unintended side effects for queues that e.g. were deleted when not all members of a quorum queueu were running when the queue was deleted. In this case the Ra system would have recovered the members that were not deleted which is not ideal as a dangling member would just sit and loop in pre vote state and a future declaration of the queue may partially fail.

Instead we rely on the meta data store for the truth about which members should be restarted after a ra system restart.

@kjnilsson kjnilsson force-pushed the qq-system-recovery branch 2 times, most recently from bdf1079 to df22e17 Compare July 24, 2024 10:21
Previously we used the `registered` approach where all Ra servers that
have a registered name would be recovered. This could have unintended
side effects for queues that e.g. were deleted when not all members of
a quorum queueu were running when the queue was deleted. In this case
the Ra system would have recovered the members that were not deleted
which is not ideal as a dangling member would just sit and loop in
pre vote state and a future declaration of the queue may partially
fail.

Instead we rely on the meta data store for the truth about which
members should be restarted after a ra system restart.
@kjnilsson kjnilsson force-pushed the qq-system-recovery branch from df22e17 to 4863bc3 Compare July 24, 2024 13:24
@kjnilsson kjnilsson merged commit 1a9da90 into main Jul 24, 2024
191 checks passed
@kjnilsson kjnilsson deleted the qq-system-recovery branch July 24, 2024 15:14
michaelklishin added a commit that referenced this pull request Jul 24, 2024
QQ: use a dedicated function for queue recovery after Ra system restart. (backport #11809)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants