Skip to content

Backport test fixes from main to v4.1.x #13377

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 9 commits into from
Feb 20, 2025
Merged

Conversation

dumbbell
Copy link
Collaborator

The fixes come from the following pull requests:

They are backported together to reduce the number of pull requests and the load on CI. Also, CI would likely fail a lot more with one of the fixes missing.

There is still work to do to fix all test flakes, but backporting these will already bring an improvement for the v4.1.x branch.

[Why]
We hit some transient errors with the previous order when doing
mixed-version testing. Swapping the nodes seems to fix the problem.

(cherry picked from commit 5cbda4c)
... are being used at the same time.

[Why]
Depending on which node clusters with which, a node running an older
version of the Khepri Ra machine may not be able to apply Ra commands
and could be stuck.

There is no real solution and this clearly an unsupported scenario. An
old node won't always be able to join a newer cluster.

[How]
In the testsuites, we skip clustering tests if we detect that multiple
Khepri Ra machine versions are being used.

(cherry picked from commit 1f1a135)
[Why]
During mixed-version testing, the old node might not be able to join or
rejoin a cluster if the other nodes run a newer Khepri machine version.

[How]
The old node is used as the cluster seed node and is never touched
otherwise. Other nodes are restarted or join the cluster later.

(cherry picked from commit e76233a)
… with Khepri

[Why]
This test plays with the Mnesia database explicitly.

(cherry picked from commit f088c4f)
[Why]
We see nodes trying to use busy ports in CI from time to time.

(cherry picked from commit e76c227)
... in retry_if_coordinator_unavailable().

(cherry picked from commit ee0b5b5)
This may help debug nodes that try to open busy ports.

(cherry picked from commit a5f30ea)
@dumbbell dumbbell self-assigned this Feb 20, 2025
@mergify mergify bot added the make label Feb 20, 2025
@dumbbell dumbbell marked this pull request as ready for review February 20, 2025 13:02
@dumbbell dumbbell merged commit d60b3a7 into v4.1.x Feb 20, 2025
270 checks passed
@dumbbell dumbbell deleted the backport-test-fixes-from-main branch February 20, 2025 13:02
@dumbbell dumbbell added this to the 4.1.0 milestone Feb 20, 2025
@dumbbell dumbbell changed the title Backport test fixes from main Backport test fixes from main to v4.1.x Feb 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant