[llvm][llvm-lit] Add option to create unique result file names if results already exist #112729

DavidSpickett · 2024-10-17T15:28:07Z

When running a build like:

ninja check-clang check-llvm

Prior to my changes you ended up with one results file, in this specific case Junit XML:

results.xml

This would only include the last set of tests lit ran, which were for llvm. To get around this, many CI systems will run one check target, move the file away, then run another, somehow propgating the return code as well.

rectode=0
for target in targets:
  ninja target
  retcode=$?
  mv results.xml results-${target}.xml
<report the overall return code>

I want to use something like this Buildkite reporting plugin in CI, which needs to have all the results available:
https://buildkite.com/docs/agent/v3/cli-annotate#using-annotations-to-report-test-results

Modifying CI's build scripts for Windows and Linux is a lot of work. So my changes instead make lit detect an existing result file and modify the file name to find a new file to write to. Now you will get:

results.xml results.<tempfile generated value>.xml

This will work for all result file types since I'm doing it in the base Report class. Now you've got separate files, it's easy to collect them with <path>/*.xml.

Note that the <tempfile generated value> is not ordered.

llvmbot · 2024-10-17T15:28:48Z

@llvm/pr-subscribers-testing-tools

Author: David Spickett (DavidSpickett)

Changes

When running a build like:
ninja check-clang check-llvm

Prior to my changes you ended up with one results file, in this specific case
Junit XML:
results.xml

This would only include the last set of tests lit ran, which were for llvm.
To get around this, many CI systems will run one check target, move the file
away, then run another.

for target in targets:
ninja target
mv results.xml results-${target}.xml

I want to use something like this Buildkite reporting plugin in CI,
which needs to have all the results available:
https://buildkite.com/docs/agent/v3/cli-annotate#using-annotations-to-report-test-results

Modifying CI's build scripts for Windows and Linux is a lot of work.
So my changes instead make lit detect an existing result file and modify the file name
until it finds a unique file name to write to.

Now you will get:
results.xml results.1.xml

This will work for all result file types since I'm doing it in the base Report
class. Now you've got separate files, it's easy to collect them with <path>/*.xml.

The number will increment as many times as needed until a useable name
is found.

Full diff: https://github.com/llvm/llvm-project/pull/112729.diff

3 Files Affected:

(modified) llvm/utils/lit/lit/cl_arguments.py (+20-8)
(modified) llvm/utils/lit/lit/reports.py (+53-30)
(added) llvm/utils/lit/tests/unique-output-file.py (+28)

diff --git a/llvm/utils/lit/lit/cl_arguments.py b/llvm/utils/lit/lit/cl_arguments.py
index 5ccae4be096796..3b11342dec2162 100644
--- a/llvm/utils/lit/lit/cl_arguments.py
+++ b/llvm/utils/lit/lit/cl_arguments.py
@@ -175,6 +175,13 @@ def parse_args():
         type=lit.reports.TimeTraceReport,
         help="Write Chrome tracing compatible JSON to the specified file",
     )
+    execution_group.add_argument(
+        "--use-unique-output-file-name",
+        help="When enabled, lit will not overwrite existing test report files. "
+        "Instead it will modify the file name until it finds a file name "
+        "that does not already exist. [Default: Off]",
+        action="store_true",
+    )
     execution_group.add_argument(
         "--timeout",
         dest="maxIndividualTestTime",
@@ -332,16 +339,21 @@ def parse_args():
     else:
         opts.shard = None
 
-    opts.reports = filter(
-        None,
-        [
-            opts.output,
-            opts.xunit_xml_output,
-            opts.resultdb_output,
-            opts.time_trace_output,
-        ],
+    opts.reports = list(
+        filter(
+            None,
+            [
+                opts.output,
+                opts.xunit_xml_output,
+                opts.resultdb_output,
+                opts.time_trace_output,
+            ],
+        )
     )
 
+    for report in opts.reports:
+        report.use_unique_output_file_name = opts.use_unique_output_file_name
+
     return opts
 
 
diff --git a/llvm/utils/lit/lit/reports.py b/llvm/utils/lit/lit/reports.py
index d2d719b076bc70..c2a0239e360f37 100755
--- a/llvm/utils/lit/lit/reports.py
+++ b/llvm/utils/lit/lit/reports.py
@@ -1,7 +1,9 @@
+import abc
 import base64
 import datetime
 import itertools
 import json
+import os
 
 from xml.sax.saxutils import quoteattr as quo
 
@@ -14,11 +16,43 @@ def by_suite_and_test_path(test):
     return (test.suite.name, id(test.suite), test.path_in_suite)
 
 
-class JsonReport(object):
+class Report(object):
     def __init__(self, output_file):
         self.output_file = output_file
+        # Set by the option parser later.
+        self.use_unique_output_file_name = False
 
     def write_results(self, tests, elapsed):
+        if self.use_unique_output_file_name:
+            file = None
+            filepath = self.output_file
+            attempt = 0
+            while file is None:
+                try:
+                    file = open(filepath, "x")
+                except FileExistsError:
+                    attempt += 1
+                    # If there is an extension insert before that because most
+                    # glob patterns for these will be '*.extension'. Otherwise
+                    # add to the end of the path.
+                    path, ext = os.path.splitext(self.output_file)
+                    filepath = path + f".{attempt}" + ext
+
+            with file:
+                self._write_results_to_file(tests, elapsed, file)
+        else:
+            # Overwrite if the results already exist.
+            with open(self.output_file, "w") as file:
+                self._write_results_to_file(tests, elapsed, file)
+
+    @abc.abstractmethod
+    def _write_results_to_file(self, tests, elapsed, file):
+        """Write test results to the file object "file"."""
+        pass
+
+
+class JsonReport(Report):
+    def _write_results_to_file(self, tests, elapsed, file):
         unexecuted_codes = {lit.Test.EXCLUDED, lit.Test.SKIPPED}
         tests = [t for t in tests if t.result.code not in unexecuted_codes]
         # Construct the data we will write.
@@ -67,9 +101,8 @@ def write_results(self, tests, elapsed):
 
             tests_data.append(test_data)
 
-        with open(self.output_file, "w") as file:
-            json.dump(data, file, indent=2, sort_keys=True)
-            file.write("\n")
+        json.dump(data, file, indent=2, sort_keys=True)
+        file.write("\n")
 
 
 _invalid_xml_chars_dict = {
@@ -88,21 +121,18 @@ def remove_invalid_xml_chars(s):
     return s.translate(_invalid_xml_chars_dict)
 
 
-class XunitReport(object):
-    def __init__(self, output_file):
-        self.output_file = output_file
-        self.skipped_codes = {lit.Test.EXCLUDED, lit.Test.SKIPPED, lit.Test.UNSUPPORTED}
+class XunitReport(Report):
+    skipped_codes = {lit.Test.EXCLUDED, lit.Test.SKIPPED, lit.Test.UNSUPPORTED}
 
-    def write_results(self, tests, elapsed):
+    def _write_results_to_file(self, tests, elapsed, file):
         tests.sort(key=by_suite_and_test_path)
         tests_by_suite = itertools.groupby(tests, lambda t: t.suite)
 
-        with open(self.output_file, "w") as file:
-            file.write('<?xml version="1.0" encoding="UTF-8"?>\n')
-            file.write('<testsuites time="{time:.2f}">\n'.format(time=elapsed))
-            for suite, test_iter in tests_by_suite:
-                self._write_testsuite(file, suite, list(test_iter))
-            file.write("</testsuites>\n")
+        file.write('<?xml version="1.0" encoding="UTF-8"?>\n')
+        file.write('<testsuites time="{time:.2f}">\n'.format(time=elapsed))
+        for suite, test_iter in tests_by_suite:
+            self._write_testsuite(file, suite, list(test_iter))
+        file.write("</testsuites>\n")
 
     def _write_testsuite(self, file, suite, tests):
         skipped = 0
@@ -206,11 +236,8 @@ def gen_resultdb_test_entry(
     return test_data
 
 
-class ResultDBReport(object):
-    def __init__(self, output_file):
-        self.output_file = output_file
-
-    def write_results(self, tests, elapsed):
+class ResultDBReport(Report):
+    def _write_results_to_file(self, tests, elapsed, file):
         unexecuted_codes = {lit.Test.EXCLUDED, lit.Test.SKIPPED}
         tests = [t for t in tests if t.result.code not in unexecuted_codes]
         data = {}
@@ -249,17 +276,14 @@ def write_results(self, tests, elapsed):
                         )
                     )
 
-        with open(self.output_file, "w") as file:
-            json.dump(data, file, indent=2, sort_keys=True)
-            file.write("\n")
+        json.dump(data, file, indent=2, sort_keys=True)
+        file.write("\n")
 
 
-class TimeTraceReport(object):
-    def __init__(self, output_file):
-        self.output_file = output_file
-        self.skipped_codes = {lit.Test.EXCLUDED, lit.Test.SKIPPED, lit.Test.UNSUPPORTED}
+class TimeTraceReport(Report):
+    skipped_codes = {lit.Test.EXCLUDED, lit.Test.SKIPPED, lit.Test.UNSUPPORTED}
 
-    def write_results(self, tests, elapsed):
+    def _write_results_to_file(self, tests, elapsed, file):
         # Find when first test started so we can make start times relative.
         first_start_time = min([t.result.start for t in tests])
         events = [
@@ -270,8 +294,7 @@ def write_results(self, tests, elapsed):
 
         json_data = {"traceEvents": events}
 
-        with open(self.output_file, "w") as time_trace_file:
-            json.dump(json_data, time_trace_file, indent=2, sort_keys=True)
+        json.dump(json_data, time_trace_file, indent=2, sort_keys=True)
 
     def _get_test_event(self, test, first_start_time):
         test_name = test.getFullName()
diff --git a/llvm/utils/lit/tests/unique-output-file.py b/llvm/utils/lit/tests/unique-output-file.py
new file mode 100644
index 00000000000000..e0ce21aebf6950
--- /dev/null
+++ b/llvm/utils/lit/tests/unique-output-file.py
@@ -0,0 +1,28 @@
+# Check that lit will not overwrite existing result files when given
+# --use-unique-output-file-name.
+
+# Files are overwritten without the option.
+# RUN: rm -rf %t.xunit*.xml
+# RUN: echo "test" > %t.xunit.xml
+# RUN: not %{lit} --xunit-xml-output %t.xunit.xml %{inputs}/xunit-output
+# RUN: FileCheck < %t.xunit.xml %s --check-prefix=NEW
+
+# RUN: rm -rf %t.xunit*.xml
+# RUN: echo "test" > %t.xunit.xml
+# Files should not be overwritten with the option.
+# RUN: not %{lit} --xunit-xml-output %t.xunit.xml --use-unique-output-file-name %{inputs}/xunit-output
+# RUN: FileCheck < %t.xunit.xml %s --check-prefix=EXISTING
+# EXISTING: test
+# Results in a new file with "1" added.
+# RUN: FileCheck < %t.xunit.1.xml %s --check-prefix=NEW
+# NEW:      <?xml version="1.0" encoding="UTF-8"?>
+# NEW-NEXT: <testsuites time="{{[0-9.]+}}">
+# (assuming that other tests check the whole contents of the file)
+
+# The number should increment as many times as needed.
+# RUN: touch %t.xunit.2.xml
+# RUN: touch %t.xunit.3.xml
+# RUN: touch %t.xunit.4.xml
+
+# RUN: not %{lit} --xunit-xml-output %t.xunit.xml --use-unique-output-file-name %{inputs}/xunit-output
+# RUN: FileCheck < %t.xunit.5.xml %s --check-prefix=NEW
\ No newline at end of file

DavidSpickett · 2024-10-17T15:29:29Z

A question might be "why can't we just use check-all in CI?". We cannot do that because sometimes in CI we need to disable the tests of a single project, but may continue building that project. Or it may be a dependency of a project we do test, but its tests do not work in CI.

DavidSpickett · 2024-10-17T15:32:26Z

Right now ci does ninja check-x check-y check-z etc.. This means that the final part of the log is just the results of the final check-target, which might have passed but an earlier one didn't.

This feature will help me fix that by creating an aggregated report from which you can see right away which tests failed across the whole build.

llvm/utils/lit/lit/cl_arguments.py

pogo59

Looks reasonable to me, certainly having a proper summary of all the check runs at the end is a good idea. My only caution is that I'm not a Python expert by any means, and it would be nice if someone else could confirm your statement about propagating the flag.

If they don't, ping me next week and I'll approve.

llvm/utils/lit/lit/reports.py

llvm/utils/lit/lit/cl_arguments.py

jh7370

I assume you expect the area which will contain the results files will be clean before running the tests in the first place? Otherwise you'll end up having stale results files in addition to new results files, and then sweeping them all up will result in spurious data being recorded.

Another thought I had is you could just name the results file after the specific lit suite it comes from (presumably derived from the directory path or similar somehow?). In this way, new results will overwrite old results without cleanup, but should also be uniquely named.

llvm/utils/lit/lit/reports.py

llvm/utils/lit/tests/unique-output-file.py

DavidSpickett · 2024-10-18T10:50:07Z

I assume you expect the area which will contain the results files will be clean before running the tests in the first place? Otherwise you'll end up having stale results files in addition to new results files, and then sweeping them all up will result in spurious data being recorded.

Yes. I think it's fair enough to expect that CI pipelines will run a pretty simple rm *.xml or start from a clean dir. This feature saves them from a for loop where they have to propagate the return code each time.

I suppose someone might want to run lit over and over then compare variance between runs and this feature allows that. Though it's not my use case.

Another thought I had is you could just name the results file after the specific lit suite it comes from (presumably derived from the directory path or similar somehow?). In this way, new results will overwrite old results without cleanup, but should also be uniquely named.

This is what I thought of doing via the shell scripting at first. Looking at lit itself, it can take multiple paths in a single invocation and the test results are the combination of all tests discovered.

So if you had lots of things in the same sub-tree we could factor that out and use that in the filename:

lit /home/david.spickett/a/ /home/david.spickett/b/

Produces:

results.home_david_spickett_a_b.xml

Which is I think a different feature than the one I'm proposing but would work equally well for the CI's use case. With the advantage that you can see from the filename which one goes with which project. Though I think only a tiny minority of people would ever take advantage of that.

If we wanted a path based naming feature in the future, I don't think this unique file names feature would clash with it. You'd just get results.some_path_here.<randomletters>.xml.

My concern with it is figuring out how to condense multiple paths into a filename that's useful to the reader.

jh7370

LGTM. Might want to wait for @pogo59 to confirm he's happy with this approach too.

llvm/utils/lit/lit/cl_arguments.py

llvm/utils/lit/tests/unique-output-file.py

pogo59 · 2024-10-18T17:12:58Z

LGTM. Might want to wait for @pogo59 to confirm he's happy with this approach too.

Yes, LGTM

This is to later allow me to handle the choosing of the filename in one place, so that we can write unique files across multiple runs.

…ults already exist When running a build like: ninja check-clang check-llvm Prior to my changes you ended up with one results file, in this specific case Junit XML: results.xml This would only include the last set of tests lit ran, which were for llvm. To get around this, many CI systems will run one check target, move the file away, then run another. for target in targets: ninja target mv results.xml results-${target}.xml I want to use something like this Buildkite reporting plugin in CI, which needs to have all the results available: https://buildkite.com/docs/agent/v3/cli-annotate#using-annotations-to-report-test-results Modifying CI's build scripts for Windows and Linux is a lot of work. So my changes instead make lit detect an existing result file and modify the file name until it finds a unique file name to write to. Now you will get: results.xml results.1.xml This will work for all result file types since I'm doing it in the base Report class. Now you've got separate files, it's easy to collect them with `<path>/*.xml`. The number will increment as many times as needed until a useable name is found.

DavidSpickett · 2024-10-21T09:43:15Z

Thanks for the reviews, this is much nicer than what I first proposed!

DavidSpickett requested review from jh7370 and tru October 17, 2024 15:28

llvmbot added llvm-lit testing-tools labels Oct 17, 2024

DavidSpickett requested review from pogo59 and Endilll October 17, 2024 15:31

DavidSpickett commented Oct 17, 2024

View reviewed changes

llvm/utils/lit/lit/cl_arguments.py Show resolved Hide resolved

pogo59 reviewed Oct 18, 2024

View reviewed changes

llvm/utils/lit/lit/reports.py Show resolved Hide resolved

llvm/utils/lit/lit/cl_arguments.py Show resolved Hide resolved

jh7370 reviewed Oct 18, 2024

View reviewed changes

llvm/utils/lit/lit/reports.py Outdated Show resolved Hide resolved

llvm/utils/lit/lit/reports.py Outdated Show resolved Hide resolved

llvm/utils/lit/tests/unique-output-file.py Outdated Show resolved Hide resolved

llvm/utils/lit/tests/unique-output-file.py Outdated Show resolved Hide resolved

jh7370 approved these changes Oct 18, 2024

View reviewed changes

llvm/utils/lit/lit/cl_arguments.py Outdated Show resolved Hide resolved

llvm/utils/lit/tests/unique-output-file.py Show resolved Hide resolved

DavidSpickett added 11 commits October 21, 2024 09:24

[llvm][llvm-lit] Add a base class for reports that are written to files

f212fda

This is to later allow me to handle the choosing of the filename in one place, so that we can write unique files across multiple runs.

don't need to recurse rm

117636b

give an example in the help

6a54a5a

cleanup test

66c0c67

Refactor file opening

8ee1c44

newline at EOF

a202ecd

use templfile instead of incrementing number

a9f802d

example in help

cefa577

typo fix

b2796c7

precheck that there are only 2 xml files

020f9a4

DavidSpickett force-pushed the lit-reports branch from e0ae853 to 020f9a4 Compare October 21, 2024 09:42

DavidSpickett merged commit 8507dba into llvm:main Oct 21, 2024
4 of 6 checks passed

DavidSpickett deleted the lit-reports branch October 21, 2024 09:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[llvm][llvm-lit] Add option to create unique result file names if results already exist #112729

[llvm][llvm-lit] Add option to create unique result file names if results already exist #112729

Uh oh!

DavidSpickett commented Oct 17, 2024 •

edited

Loading

Uh oh!

llvmbot commented Oct 17, 2024

Uh oh!

DavidSpickett commented Oct 17, 2024

Uh oh!

DavidSpickett commented Oct 17, 2024

Uh oh!

Uh oh!

pogo59 left a comment

Uh oh!

Uh oh!

Uh oh!

jh7370 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DavidSpickett commented Oct 18, 2024 •

edited

Loading

Uh oh!

jh7370 left a comment

Uh oh!

Uh oh!

Uh oh!

pogo59 commented Oct 18, 2024

Uh oh!

DavidSpickett commented Oct 21, 2024

Uh oh!

Uh oh!

Uh oh!

[llvm][llvm-lit] Add option to create unique result file names if results already exist #112729

[llvm][llvm-lit] Add option to create unique result file names if results already exist #112729

Uh oh!

Conversation

DavidSpickett commented Oct 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Oct 17, 2024

Uh oh!

DavidSpickett commented Oct 17, 2024

Uh oh!

DavidSpickett commented Oct 17, 2024

Uh oh!

Uh oh!

pogo59 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jh7370 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DavidSpickett commented Oct 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jh7370 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

pogo59 commented Oct 18, 2024

Uh oh!

DavidSpickett commented Oct 21, 2024

Uh oh!

Uh oh!

Uh oh!

DavidSpickett commented Oct 17, 2024 •

edited

Loading

DavidSpickett commented Oct 18, 2024 •

edited

Loading