operator test tool proposal - add updates from comments #2018

jmccormick2001 · 2019-10-07T22:23:52Z

Description of the change:

added updates to the proposal draft

Motivation for the change:

jmccormick2001 · 2019-10-08T13:52:54Z

from the previous PR, a question as to whether we have the concept of recommended/optional which needs some clarification/thought.

I agree. @dmesser can you describe what the difference is between recommended and optional?

As a user, why would I want to run optional, but not recommended, tests?

Do we have a breakdown of which of the current tests fall into these two/three catagories?

If not, I think we should propose what we think and circulate with the OLM and CVP teams for feedback.

jmccormick2001 · 2019-10-08T13:53:56Z

another question as to the scope of the --bundle flag:

joelanford 7 days ago Member
IIRC, --config is already used to define the config file used by the scorecard. I think --bundle is good because "bundle" has a well-defined definition in the context of the Operator Framework and it's what we're expecting to be passed to this flag.

One caveat/question: @dmesser Are we expecting to test the entire set of bundles that make up a channel/catalog, or are we scoping the scorecard to a single bundle at a time?

jmccormick2001 · 2019-10-08T13:54:50Z

joelanford 7 days ago Member
I think one issue with the scorecard today is that it has too much configuration complexity. The following flags can all be used to define how scorecard runs. Some are optional, some are required, and some are conditionally required. There are also interdependencies between them that can cause confusion.

  --basic-tests                  Enable basic operator checks (default true)
  --cr-manifest strings          Path to manifest for Custom Resource (required) (specify flag multiple times for multiple CRs)
  --crds-dir string              Directory containing CRDs (all CRD manifest filenames must have the suffix 'crd.yaml') (default "deploy/crds")
  --csv-path string              Path to CSV being tested
  --global-manifest string       Path to manifest for Global resources (e.g. CRD manifests)
  --namespace string             Namespace of custom resource created in cluster
  --namespaced-manifest string   Path to manifest for namespaced resources (e.g. RBAC and Operator manifest)
  --olm-deployed                 The OLM has deployed the operator. Use only the CSV for test data
  --olm-tests                    Enable OLM integration checks (default true)

If we go all-in on OLM integration (which I believe is a goal), then I think we should require a bundle and leverage other OLM integrations (e.g. installing OLM and running an operator with OLM) so that we can reduce configuration complexity and make running and troubleshooting scorecard much more straightforward.

@dmesser @shawn-hurley WDYT?

camilamacedo86

/lgtm for Me.

dmesser · 2019-10-08T16:44:18Z

How does this tool allow a pipeline to detect a situation where all required tests passed but some recommended tests did not?

jmccormick2001 · 2019-10-08T23:39:03Z

How does this tool allow a pipeline to detect a situation where all required tests passed but some recommended tests did not?

My thoughts currently were to just report all the test's with their status and let end users iterate thru them to determine if any tests failed they would care about. The 'required' tests I treated differently in that if they all passed, the exit code would be 0, if any of the required tests failed then the exit code would not be 0. I would probably keep the 'suite' notion but make it just another label key 'suite=olm' or 'suite=basic', 'suite=mysuiteisspecial', and the text scorecard report would break on the 'suite' key sort of like it works today visually. Any test other than a 'required' test would be up to the end user to evaluate how they see fit.

Currently since we don't have labels implemented, I'm treating all the current tests as 'required', this would change after labels are implemented. We would need to determine exactly if the current set of tests are indeed 'required', I'm fuzzy on that.

If this notion of making 'suite' just a reserved label key would work, then I could proceed with removing the suite level from the v1alpha2 API as Joe suggested. Let me know if this works and I'll move forward with that change in this PR. @joelanford @dmesser

joelanford

LGTM after fixing typo.

doc/proposals/openshift-4.3/operator-testing-tool.md

joelanford · 2019-10-09T13:36:27Z

and the text scorecard report would break on the 'suite' key sort of like it works today visually

@jmccormick2001 I'm not even sure this is necessary. Just to throw something out for discussion, what about something like these?

Non-verbose text

$ operator-sdk scorecard
Spec Block Exists ... pass
Status Block Exists ... fail
	Suggestions:
		- Add a 'status' field to your Custom Resource

Writing into CRs has an effect ... error
	Errors:
		- error getting proxy logs: timeout error

Provided APIs have validation ... pass
Owned CRDs have resources listed ... pass
CRs have at least 1 example ... pass
Spec fields with descriptors ... pass
Status fields with descriptors ... pass

Verbose text

$ operator-sdk scorecard -v
Spec Block Exists
	Status: pass
	Description: "Custom Resource has a Spec Block"
	Labels:
		"necessity": "required"
		"execution": "static"
		"suite": "basic"
Status Block Exists
	Status: fail
	Description: "Custom Resource has a Status Block"
	Labels:
		"necessity": "required"
		"execution": "runtime"
		"suite": "basic"
	Suggestions:
		- Add a 'status' field to your Custom Resource

Writing into CRs has an effect
	Status: error
	Description: "A CR sends PUT/POST requests to the API server to modify resources in response to spec block changes"
	Labels:
		"necessity": "required"
		"execution": "runtime"
		"suite": "basic"
	Errors:
		- error getting proxy logs: timeout error

Provided APIs have validation
	Status: pass
	Description: "All CRDs have an OpenAPI validation subsection"
	Labels:
		"necessity": "required"
		"execution": "static"
		"suite": "olm"
Owned CRDs have resources listed
	Status: pass
	Description: "All Owned CRDs contain a resources subsection"
	Labels:
		"necessity": "required"
		"execution": "runtime"
		"suite": "olm"
CRs have at least 1 example
	Status: pass
	Description: "The CSV's metadata contains an alm-examples section"
	Labels:
		"execution": "static"
		"suite": "olm"
		"necessity": "required"
Spec fields with descriptors
	Status: pass
	Description: "All spec fields have matching descriptors in the CSV"
	Labels:
		"necessity": "required"
		"execution": "static"
		"suite": "olm"
Status fields with descriptors
	Status: pass
	Description: "All status fields have matching descriptors in the CSV"
	Labels:
		"necessity": "required"
		"execution": "runtime"
		"suite": "olm"

JSON

$ operator-sdk scorecard -o json
[
  {
    "name": "Spec Block Exists",
    "status": "pass",
    "description": "Custom Resource has a Spec Block",
    "labels": {
      "execution": "static",
      "necessity": "required",
      "suite": "basic"
    }
  },
  {
    "name": "Status Block Exists",
    "status": "fail",
    "description": "Custom Resource has a Status Block",
    "labels": {
      "execution": "runtime",
      "necessity": "required",
      "suite": "basic"
    },
    "suggestions": [
      "Add a 'status' field to your Custom Resource"
    ]
  },
  {
    "name": "Writing into CRs has an effect",
    "status": "error",
    "description": "A CR sends PUT/POST requests to the API server to modify resources in response to spec block changes",
    "labels": {
      "execution": "runtime",
      "necessity": "required",
      "suite": "basic"
    },
    "errors": [
      "error getting proxy logs: timeout error"
    ]
  },
  {
    "name": "Provided APIs have validation",
    "status": "pass",
    "description": "All CRDs have an OpenAPI validation subsection",
    "labels": {
      "execution": "static",
      "necessity": "required",
      "suite": "olm"
    }
  },
  {
    "name": "Owned CRDs have resources listed",
    "status": "pass",
    "description": "All Owned CRDs contain a resources subsection",
    "labels": {
      "execution": "runtime",
      "necessity": "required",
      "suite": "olm"
    }
  },
  {
    "name": "CRs have at least 1 example",
    "status": "pass",
    "description": "The CSV's metadata contains an alm-examples section",
    "labels": {
      "execution": "static",
      "necessity": "required",
      "suite": "olm"
    }
  },
  {
    "name": "Spec fields with descriptors",
    "status": "pass",
    "description": "All spec fields have matching descriptors in the CSV",
    "labels": {
      "execution": "static",
      "necessity": "required",
      "suite": "olm"
    }
  },
  {
    "name": "Status fields with descriptors",
    "status": "pass",
    "description": "All status fields have matching descriptors in the CSV",
    "labels": {
      "execution": "runtime",
      "necessity": "required",
      "suite": "olm"
    }
  }
]

Co-Authored-By: Joe Lanford <[email protected]>

jmccormick2001 · 2019-10-09T14:09:53Z

@joelanford those look pretty good to me, I'm assuming here the sort order is suite followed by test name.

On a sort of related note, I'm assuming we want to keep the scorecard configuration file format the same as it exists today, that being broken out into sections for each plugin which equates to a suite.

joelanford · 2019-10-09T14:46:13Z

I could be wrong, but I think the order is currently dictated in code based on how we initialize everything. I think we basically just iterate the slice we create, and that ends up being the output order. We could probably continue with that approach.

dmesser · 2019-10-09T14:55:23Z

@jmccormick2001 One use case for this tool is to be run non-interactively in a pipeline. So in a non-interactive fashion the pipeline needs to be able to make a call on whether it's fine or not fine to let failing recommended test still constitute a "pass".
One option would be to have a CLI switch that causes recommended and optional tests to fail. Another would be to allow introspection of the test result in a machine-readable way so something like a pipeline can make an informed decision on whether to proceed or not.

joelanford · 2019-10-09T15:05:33Z

@jmccormick2001 Re: config file, that's a good question. I have a multi-part answer:

While we're supporting v1alpha1, I think the current format needs to continue to be supported when --version=v1alpha1.
We're planning to remove support for external plugins in v1alpha2 and begin enforcing that a bundle is passed to the scorecard, right? So in the long run, I think the config will be simplified as well.
IMO, once we release support for --version=v1alpha2 and/or change the default to v1alpha2, the output format and config format are set. At that point, I think we would need to bump to v1alpha3 to make a breaking change to either of those.

joelanford · 2019-10-09T15:09:37Z

@dmesser The scorecard has (and will continue to have) a JSON output format that is intended to be used by non-interactive pipelines. Our current understanding is that we'll have an appropriate exit code based on the required tests, but that any other pass/fail logic would be handled by the pipeline by introspecting the JSON output. Does that suffice?

jmrodri · 2019-10-09T20:24:32Z

/test e2e-aws-ansible

dmesser · 2019-10-10T09:38:05Z

@joelanford I think by itself this would suffice. We need to document examples that show to use a JSON parser in order to drive scripting logic. It would just be much easier, if there would be a way to directly influence when the exit code is non-zero: on failing required tests only, or additionally on failed recommended tests.

camilamacedo86 · 2019-10-10T09:46:27Z

/test e2e-aws-ansible

jmccormick2001 · 2019-10-10T13:56:09Z

@dmesser @joelanford I wonder about a command flag that would take in the types of tests (labels), and if any of the tests that matched those labels had a failure, then the exit code would be non-zero?
something like "--report-fail='required,optional'", with that, if an optional test fails, the exit code gets non-zero returned.

joelanford · 2019-10-11T04:15:15Z

@jmccormick2001 @dmesser my favorite option for exit code handling is that the exit code always reflects the pass/fail of ALL of the tests that were part of the run (so no special handling for required tests).

If you want to know if some arbitrary set of tests fail (e.g. necessity=required), you would either use label selection to run only that set of tests, or you would run some superset with -o json, and if the pass/fail exit code indicates a failure, you could use jq fu for your own custom exit code logic.

For example, the following jq command would exit with a failure if any required tests fail or error:

jq --exit-status '[.[] | select(.labels.necessity == "required" and (.status == "fail" or .status == "error"))] | length == 0'

As a separate but related note, I think we need at least two separate exit codes.

To handle the case where the scorecard subcommand itself fails for some reason (e.g. couldn't read the kubernetes config, couldn't connect to the apiserver, etc.)
To handle the pass/fail case where a test that was expected to pass did not pass. In this case, the output would be guaranteed to be valid text or JSON.

I like this option because it would require no extra CLI flags and the docs about the exit code would be as simple as:

Scorecard exits with exit code 1 if it encounters an error setting up or executing the tests and 2 if any of the tests result in a "fail" or "error" state.

dmesser · 2019-10-11T11:49:17Z

@jmccormick2001 - I think that makes sense. A pipeline can run the test tool simply twice:

with the required tests and then bail if the exit code is non-zero.
with the recommended tests and then bail of the exit code is 1 or bail or not bail if the exit code is 2.

To drive better reports the JSON structure allows for proper querying using JSONPath.

jmccormick2001 · 2019-10-11T14:43:14Z

I added the comments on exit code design and handling to the proposal under the Design Details section to capture it.

joelanford

LGTM after CVP signs off on exit code strategy and my one comment is addressed.

joelanford · 2019-10-11T15:08:37Z

doc/proposals/openshift-4.3/operator-testing-tool.md

+The exit code of scorecard would be 0 if all required tests passed. The exit code would be non-zero if any of the required tests failed. With this change scores are essentially replaced with a list of pass/fail(s).
+
+A message produced by the scorecard will show whether or not the required
+tests fully passed or not.  Tests will be labeled in such a way as to 
+specify them as required or not.


This needs to be rewritten or removed in light of the new Exit Codes section.

made an update to the PR to remove the exit code details from the Story and instead have them in the new Exit Code detailed design section.

joelanford · 2019-10-11T15:10:38Z

/hold

Until the v0.11.0 release is completed.

jmccormick2001 · 2019-10-14T15:36:16Z

@dmesser @joelanford for the proposal, I wanted to confirm the following to be the key/values on the labels that scorecard would implement:

key = 'necessity'
possible values='required', 'optional', 'recommended'

key = 'suite'
possible values = 'olm', 'basic'

on the scorecard CLI, the flag would be the same as on other kubectl commands as follows:

-l, --selector='': Selector (label query) to filter on, supports '=', '==',
and '!='.(e.g. -l key1=value1,key2=value2)

shawn-hurley · 2019-10-14T18:00:45Z

doc/proposals/openshift-4.3/operator-testing-tool.md

- Announce deprecation and support policy of the existing feature
- Deprecate the feature
+- We are adding a new --version flag to allow users to switch between
+v1alpha1 and the proposed v1alpha2 or vice-versa for backward compatiblity


How long will keep v1alpha1?

maybe only keep it for the next release/fixes (0.12->0.12.x), then remove it following any release after that?

@dmesser @joelanford wdyt?

Yeah I think keeping it for 1 release would be enough.

So, for example:

0.11.x has v1alpha1 only

0.12.x has v1alpha1 and v1alpha2

0.13.x has v1alpha2 only

shawn-hurley · 2019-10-14T18:12:38Z

doc/proposals/openshift-4.3/operator-testing-tool.md

@@ -124,6 +129,12 @@ The “source of truth” for validation would need to be established such as wh

 If no runtime tests are specified, then the scorecard would only run the static tests and not depend on a running cluster.

+Allow tests to declare a set of labels that can be introspected by the scorecard at runtime.


This might be obvious to everyone, and sorry if it is, but how does a test declare a set of labels?

here is my current thinking and what I've been prototyping so far...to add labels to the CheckSpec Basic test, I add the labels to the 'TestInfo' struct for that test like so:

return &CheckSpecTest{ BasicTestConfig: conf, TestInfo: schelpers.TestInfo{ Name: "Spec Block Exists", Description: "Custom Resource has a Spec Block", Cumulative: false, Labels: labels.Set{"required": "true", "suite": "basic"}, }, }

For now, they're statically defined in code. In the future, where we may support Kubernetes Job-based tests, they would be declared in metadata.labels

…led design section

joelanford · 2019-10-15T19:42:25Z

/hold cancel

v0.11.0 is released, so we can proceed with merging when we're ready.

@jmccormick2001 Re: #2018 (comment), that sounds good to me.

jmccormick2001 · 2019-10-17T15:19:15Z

@dmesser @joelanford @shawn-hurley just a heads-up, I was planning on making all the existing scorecard basic and olm tests 'required' for v1alpha2. Let me know if you think any of them should be labeled other than required.

camilamacedo86 · 2019-10-21T18:43:57Z

@jmccormick2001
i think it needs a rebase with the master to pass in the CI.

add updates from comments

f911fec

openshift-ci-robot requested review from camilamacedo86 and hasbro17 October 7, 2019 22:23

openshift-ci-robot added the size/S Denotes a PR that changes 10-29 lines, ignoring generated files. label Oct 7, 2019

jmccormick2001 changed the title ~~add updates from comments~~ operator test tool proposal - add updates from comments Oct 7, 2019

jmccormick2001 self-assigned this Oct 8, 2019

camilamacedo86 approved these changes Oct 8, 2019

View reviewed changes

joelanford approved these changes Oct 9, 2019

View reviewed changes

doc/proposals/openshift-4.3/operator-testing-tool.md Outdated Show resolved Hide resolved

Update doc/proposals/openshift-4.3/operator-testing-tool.md

20d617c

Co-Authored-By: Joe Lanford <[email protected]>

add exit code design details based on comments

dcbae58

openshift-ci-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Oct 11, 2019

joelanford approved these changes Oct 11, 2019

View reviewed changes

openshift-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Oct 11, 2019

shawn-hurley reviewed Oct 14, 2019

View reviewed changes

remove exit code details from Story 1 in favor of the Exit Code detai…

b2fac2b

…led design section

joelanford mentioned this pull request Oct 15, 2019

V1alpha2 api - scorecard output changes for v1alpha2 version #1995

Merged

openshift-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Oct 15, 2019

camilamacedo86 added approved Indicates a PR has been approved by an approver from all required OWNERS files. kind/documentation Categorizes issue or PR as related to documentation. labels Oct 21, 2019

Merge branch 'master' into proposal-updates

1894336

jmccormick2001 merged commit e713519 into operator-framework:master Oct 21, 2019

joelanford mentioned this pull request Oct 30, 2019

scorecard - Add list flag #2137

Merged

		@@ -124,6 +129,12 @@ The “source of truth” for validation would need to be established such as wh

		If no runtime tests are specified, then the scorecard would only run the static tests and not depend on a running cluster.

		Allow tests to declare a set of labels that can be introspected by the scorecard at runtime.

operator test tool proposal - add updates from comments #2018

operator test tool proposal - add updates from comments #2018

Uh oh!

Conversation

jmccormick2001 commented Oct 7, 2019

Uh oh!

jmccormick2001 commented Oct 8, 2019

Uh oh!

jmccormick2001 commented Oct 8, 2019

Uh oh!

jmccormick2001 commented Oct 8, 2019

Uh oh!

camilamacedo86 left a comment

Choose a reason for hiding this comment

Uh oh!

dmesser commented Oct 8, 2019

Uh oh!

jmccormick2001 commented Oct 8, 2019

Uh oh!

joelanford left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

joelanford commented Oct 9, 2019

Non-verbose text

Verbose text

JSON

Uh oh!

jmccormick2001 commented Oct 9, 2019

Uh oh!

joelanford commented Oct 9, 2019

Uh oh!

dmesser commented Oct 9, 2019

Uh oh!

joelanford commented Oct 9, 2019

Uh oh!

joelanford commented Oct 9, 2019

Uh oh!

jmrodri commented Oct 9, 2019

Uh oh!

dmesser commented Oct 10, 2019

Uh oh!

camilamacedo86 commented Oct 10, 2019

Uh oh!

jmccormick2001 commented Oct 10, 2019

Uh oh!

joelanford commented Oct 11, 2019

Uh oh!

dmesser commented Oct 11, 2019

Uh oh!

jmccormick2001 commented Oct 11, 2019

Uh oh!

joelanford left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

joelanford commented Oct 11, 2019

Uh oh!

jmccormick2001 commented Oct 14, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

joelanford commented Oct 15, 2019

Uh oh!

jmccormick2001 commented Oct 14, 2019 •

edited

Loading