GODRIVER-2114 Fix failing KMS TLS tests #712

benjirewis · 2021-08-03T17:57:24Z

Uses the new kms_http_server.py instead of the now-removed, trivial mock_kms.js.

benjirewis · 2021-08-03T17:58:28Z

.evergreen/config.yml

-          EOF
-          mongo --nodb mock_kms.js
+          . ./activate_venv.sh
+    - command: shell.exec


Much like the mock OCSP functions, the first command sets up the local environment in the foreground, and the second command starts the Python mock server in the background. These need to be separated for the tests to consistently find the mock KMS server.

Interesting. https://github.com/evergreen-ci/evergreen/wiki/Project-Commands#shellexec notes:

background: if set to true, does not wait for the script to exit before running the next commands

My new hypothesis for the cause of the connection refused errors:

Evergreen would run start-kms-mock-server and proceed before the script completed.

The Go driver tests started before the mock KMS server started.

Starting the virtual environment in a non-background command before helps. But I think this is still hiding a race.

If the mock KMS server does not establish listening sockets before the Go driver tests run, I suspect the same issue will occur. But, given that the OCSP tasks have a similar setup, I bet the likelihood of the KMS server not starting before the Go tests run is slim to none. If we see it failing in the future, we could consider appending a foreground command to loop until it can establish a connection on port 8000. That seems unnecessary for now.

That sounds exactly right. I think all the current mock servers in testing (KMS, OCSP and maybe load balancer?) have this racey behavior. It seems that if you only have the server-starting call in the background function, the tests pretty much never start before the server. So, if we start to see failures we can consider something like a foreground loop.

mongo/integration/client_side_encryption_prose_test.go

kevinAlbs

LGTM! Will this need to be cherry-picked on to the 1.7 branch to have tests passing on that branch? If so, can you create a ticket to track this change (description can be brief). That will tie those commits together.

kevinAlbs · 2021-08-04T19:06:29Z

.evergreen/config.yml

@@ -827,20 +827,18 @@ functions:

  start-kms-mock-server:
    - command: shell.exec
-      type: test


Removing type: test seems right here. The default command_type on L13 is setup. If this task fails it will indicate a setup failure, rather than a test failure (https://github.com/evergreen-ci/evergreen/wiki/Project-Configuration-Files#command-failure-colors)

Yeah setup definitely seems like the right type; not sure why I had test before.

kevinAlbs · 2021-08-04T19:47:53Z

.evergreen/config.yml

-          EOF
-          mongo --nodb mock_kms.js
+          . ./activate_venv.sh
+    - command: shell.exec


Interesting. https://github.com/evergreen-ci/evergreen/wiki/Project-Commands#shellexec notes:

background: if set to true, does not wait for the script to exit before running the next commands

My new hypothesis for the cause of the connection refused errors:

Evergreen would run start-kms-mock-server and proceed before the script completed.

The Go driver tests started before the mock KMS server started.

Starting the virtual environment in a non-background command before helps. But I think this is still hiding a race.

If the mock KMS server does not establish listening sockets before the Go driver tests run, I suspect the same issue will occur. But, given that the OCSP tasks have a similar setup, I bet the likelihood of the KMS server not starting before the Go tests run is slim to none. If we see it failing in the future, we could consider appending a foreground command to loop until it can establish a connection on port 8000. That seems unnecessary for now.

benjirewis

Filed GODRIVER-2114. Let's backport to both release/1.7 and release/1.6 since they both have KMS TLS tests and corresponding Evergreen waterfall tasks.

benjirewis · 2021-08-04T20:17:46Z

.evergreen/config.yml

@@ -827,20 +827,18 @@ functions:

  start-kms-mock-server:
    - command: shell.exec
-      type: test


Yeah setup definitely seems like the right type; not sure why I had test before.

benjirewis · 2021-08-04T20:21:26Z

.evergreen/config.yml

-          EOF
-          mongo --nodb mock_kms.js
+          . ./activate_venv.sh
+    - command: shell.exec


That sounds exactly right. I think all the current mock servers in testing (KMS, OCSP and maybe load balancer?) have this racey behavior. It seems that if you only have the server-starting call in the background function, the tests pretty much never start before the server. So, if we start to see failures we can consider something like a foreground loop.

Benjamin Rewis added 3 commits August 2, 2021 13:21

Initial fix

2035d75

Updates with logging

5feb780

Start separate servers again

95e7fc5

benjirewis commented Aug 3, 2021

View reviewed changes

benjirewis requested a review from kevinAlbs August 3, 2021 17:59

benjirewis marked this pull request as ready for review August 3, 2021 17:59

kevinAlbs approved these changes Aug 4, 2021

View reviewed changes

benjirewis changed the title ~~Fix failing KMS TLS tests~~ GODRIVER-2114 Fix failing KMS TLS tests Aug 4, 2021

benjirewis commented Aug 4, 2021

View reviewed changes

benjirewis merged commit e78e29b into mongodb:master Aug 4, 2021

benjirewis deleted the kmsTlsFix branch August 4, 2021 20:29

benjirewis added a commit that referenced this pull request Aug 4, 2021

GODRIVER-2114 Fix failing KMS TLS tests (#712)

82d3467

benjirewis added a commit that referenced this pull request Aug 4, 2021

GODRIVER-2114 Fix failing KMS TLS tests (#712)

f56ad8e

faem pushed a commit to kubedb/mongo-go-driver that referenced this pull request Mar 17, 2022

GODRIVER-2114 Fix failing KMS TLS tests (mongodb#712)

be4f3fd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GODRIVER-2114 Fix failing KMS TLS tests #712

GODRIVER-2114 Fix failing KMS TLS tests #712

Uh oh!

benjirewis commented Aug 3, 2021

Uh oh!

benjirewis Aug 3, 2021

Uh oh!

kevinAlbs Aug 4, 2021

Uh oh!

benjirewis Aug 4, 2021

Uh oh!

Uh oh!

kevinAlbs left a comment

Uh oh!

kevinAlbs Aug 4, 2021

Uh oh!

benjirewis Aug 4, 2021

Uh oh!

kevinAlbs Aug 4, 2021

Uh oh!

benjirewis left a comment

Uh oh!

benjirewis Aug 4, 2021

Uh oh!

benjirewis Aug 4, 2021

Uh oh!

Uh oh!

GODRIVER-2114 Fix failing KMS TLS tests #712

GODRIVER-2114 Fix failing KMS TLS tests #712

Uh oh!

Conversation

benjirewis commented Aug 3, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kevinAlbs left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

benjirewis left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!