Move node draining from actuator into machine controller #174

ingvagabund · 2019-03-07T12:35:31Z

Node draining is a generic operation independent of a specific actuator.
Thus, it makes sense to move the code from actuator into the machine controllers.
The node draining code itself is imported from github.com/openshift/kubernetes-drain.

At the same time it's currently impossible to use the controller-runtime client for node draining
due to missing Patch operation (kubernetes-sigs/controller-runtime#235).
Thus, the machine controller needs to initialize kubeclient as well in order to
implement the node draining logic. Once the Patch operation is implemented,
the draining logic can be updated to replace kube client with controller runtime client.

Also, initialize event recorder to generate node draining event.

Corresponding openshift upstream PR here: openshift/cluster-api#11

Node draining is a generic operation independent of a specific actuator. Thus, it makes sense to move the code from actuator into the machine controllers. The node draining code itself is imported from github.com/openshift/kubernetes-drain. At the same time it's currently impossible to use the controller-runtime client for node draining due to missing Patch operation (kubernetes-sigs/controller-runtime#235). Thus, the machine controller needs to initialize kubeclient as well in order to implement the node draining logic. Once the Patch operation is implemented, the draining logic can be updated to replace kube client with controller runtime client. Also, initialize event recorder to generate node draining event.

enxebre · 2019-03-08T08:46:18Z

vendor/github.com/openshift/cluster-api/pkg/controller/machine/controller.go

+	NodeNameEnvVar = "NODE_NAME"
+
+	// ExcludeNodeDrainingAnnotation annotation explicitly skips node draining if set
+	ExcludeNodeDrainingAnnotation = "machine.openshift.io/exclude-node-draining"


there was some discussion with @frobware and @kalexand-rh on slack about reconsidering this annotation name, otherwise looks good to me

Definitely not a blocker for the PR as we already have it merged. Though, I am fine changing the name to something more proper.

enxebre · 2019-03-08T09:02:17Z

/approve

openshift-ci-robot · 2019-03-08T09:02:34Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: enxebre

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [enxebre]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

michaelgugino · 2019-03-08T12:42:40Z

vendor/github.com/openshift/cluster-api/pkg/controller/machine/controller.go

+		// deleted without a manual intervention.
+		if _, exists := m.ObjectMeta.Annotations[ExcludeNodeDrainingAnnotation]; !exists && m.Status.NodeRef != nil {
+			if err := func() error {
+				kubeClient, err := kubernetes.NewForConfig(r.config)


Maybe we should add the kubeClient to the reconciler as with do with oc client?

I intentionally did no do that since this is only a temporary solution until controller-runtime implements Patch. Once that happens, the kube client initialization can be completely dropped.

michaelgugino · 2019-03-08T12:45:07Z

vendor/github.com/openshift/cluster-api/pkg/controller/machine/controller.go

@@ -145,6 +163,51 @@ func (r *ReconcileMachine) Reconcile(request reconcile.Request) (reconcile.Resul
 			return reconcile.Result{}, nil
 		}
 		klog.Infof("reconciling machine object %v triggers delete.", name)
+
+		// Drain node before deletion


We should think about breaking this large function apart, it's going to be very difficult if not impossible to test.

there are not unit tests so far other than the ones provided by https://github.com/openshift/kubernetes-drain/. From the functional point of view we have https://github.com/openshift/cluster-api-actuator-pkg/blob/master/pkg/e2e/actuators/actuators.go#L182.

michaelgugino

/lgtm

frobware

We should do the annotation name change as a different PR.

test/machines/machines_test.go

Signed-off-by: Vince Prignano <[email protected]>

openshift-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Mar 7, 2019

openshift-ci-robot requested review from bison and frobware March 7, 2019 12:35

ingvagabund mentioned this pull request Mar 7, 2019

UPSTREAM: <carry>: openshift: Machine controller: drain node before machine deletion openshift/cluster-api#11

Merged

enxebre reviewed Mar 8, 2019

View reviewed changes

openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 8, 2019

michaelgugino reviewed Mar 8, 2019

View reviewed changes

openshift-ci-robot assigned michaelgugino Mar 8, 2019

michaelgugino approved these changes Mar 8, 2019

View reviewed changes

openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Mar 8, 2019

frobware requested changes Mar 8, 2019

View reviewed changes

test/machines/machines_test.go Show resolved Hide resolved

frobware approved these changes Mar 8, 2019

View reviewed changes

openshift-merge-robot merged commit 0d58bd9 into openshift:master Mar 8, 2019

michaelgugino pushed a commit to mgugino-upstream-stage/cluster-api-provider-aws that referenced this pull request Feb 12, 2020

Update getting-started.md (openshift#174)

0fde929

Signed-off-by: Vince Prignano <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move node draining from actuator into machine controller #174

Move node draining from actuator into machine controller #174

Uh oh!

ingvagabund commented Mar 7, 2019 •

edited

Loading

Uh oh!

enxebre Mar 8, 2019

Uh oh!

ingvagabund Mar 8, 2019

Uh oh!

enxebre commented Mar 8, 2019

Uh oh!

openshift-ci-robot commented Mar 8, 2019

Uh oh!

michaelgugino Mar 8, 2019

Uh oh!

ingvagabund Mar 8, 2019

Uh oh!

michaelgugino Mar 8, 2019

Uh oh!

ingvagabund Mar 8, 2019

Uh oh!

michaelgugino left a comment

Uh oh!

frobware left a comment

Uh oh!

Uh oh!

Uh oh!

Move node draining from actuator into machine controller #174

Move node draining from actuator into machine controller #174

Uh oh!

Conversation

ingvagabund commented Mar 7, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

enxebre Mar 8, 2019

Choose a reason for hiding this comment

Uh oh!

ingvagabund Mar 8, 2019

Choose a reason for hiding this comment

Uh oh!

enxebre commented Mar 8, 2019

Uh oh!

openshift-ci-robot commented Mar 8, 2019

Uh oh!

michaelgugino Mar 8, 2019

Choose a reason for hiding this comment

Uh oh!

ingvagabund Mar 8, 2019

Choose a reason for hiding this comment

Uh oh!

michaelgugino Mar 8, 2019

Choose a reason for hiding this comment

Uh oh!

ingvagabund Mar 8, 2019

Choose a reason for hiding this comment

Uh oh!

michaelgugino left a comment

Choose a reason for hiding this comment

Uh oh!

frobware left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ingvagabund commented Mar 7, 2019 •

edited

Loading