Support collecting hardware performance counters on Windows #885

wesleywiser · 2021-06-18T19:08:19Z

Collect the InstructionRetired and TotalCycles hardware performance counters on Windows using a combination of xperf and tracelog. After collection, we process the resulting trace file to calculate the totals for the performance counters and store those in the appropriate parts of the database for the run. The site is able to display the results without modification and we also have self-profiling data available to site users.

Note: the cpu-clock perf event is not an actual hardware event and no corresponding event is available out of the box on Windows. I believe we can calculate an equivalent value from the trace data but that is left as a future TODO item.

Part of #834

collector/src/execute.rs

collector/README.md

Mark-Simulacrum · 2021-06-18T19:20:15Z

Haven't reviewed too closely but at a high level this seems reasonable to me, will review closely later.

wesleywiser · 2021-06-18T19:37:26Z

Thanks @Mark-Simulacrum!

rylev

♥ Great!

collector/README.md

rylev · 2021-06-24T08:40:49Z

collector/src/etw_parser.rs

+
+    anyhow::ensure!(line.starts_with("OS Version"), "OS version line not found");
+
+    let components: Vec<_> = line.split(',').collect();


Nit: is allocating a vec here necessary?

Probably not but I didn't want to resplit the string twice. I'm not sure which is more expensive...

collector/src/etw_parser.rs

rylev · 2021-06-24T08:46:53Z

collector/src/etw_parser.rs

+        Self {
+            instructions_retired: 0,
+            total_cycles: 0,
+            // FIXME(wesleywiser): We should be properly calculating this value by taking the total time


Should we create issues in the repo for these FIXMEs?

That seems reasonable to me! I'll create issues when this merges and link to the FIXME lines.

rylev · 2021-06-24T08:49:10Z

collector/src/execute.rs

@@ -182,6 +185,7 @@ impl Profiler {
            // is rejected because it can't be used with the `profiler`
            // subcommand. (It's used with `bench_local` instead.)
            "perf-stat" => Err(anyhow!("'perf-stat' cannot be used as the profiler")),
+            "xperf-stat" => Err(anyhow!("'xperf-stat' cannot be used as the profiler")),


Should we explain why these profilers can't be used in the error message?

I'm not entirely sure why this isn't allowed. I think technically it should work but the results will just be the statistics captured during the run which is probably not very useful for profiling (vs benchmarking).

wesleywiser · 2021-06-25T21:05:30Z

Thanks for the review @rylev!

michaelwoerister · 2021-06-29T15:51:03Z

collector/src/rustc-fake.rs

+                let mut cmd = Command::new(tracelog);
+                assert!(cmd.output().is_ok(), "tracelog.exe could not be started");
+
+                cmd.args(&["-start", "counters", "-f", "counters.etl", "-eflag", "CSWITCH+PROC_THREAD+LOADER", "-PMC", "InstructionRetired,TotalCycles:CSWITCH"]);


Would you mind adding documentation somewhere on how this all works? I.e. what the workflow is, which programs are called in which order, what the flags mean? A link to a good description would be fine too.

From playing around with this it looks like the "counters" parameter a globally visible "LoggerName" used for coordinating between the tracelog and the xperf invocations, right? I think it might be a good idea to generate a more meaningful name here that includes the fact that it belongs to "rustc-perf" and maybe the name of benchmark being run. Something like rustc-perf-{benchmark}-logger.

I don't know if it would also make sense to add some randomly generated ID to it too. At least I had some problems where I had to run xperf -stop counters manually because otherwise tracelog would complain that counters already exists. I think my system got into that state because there was some crash after tracelog allocated the logger with name "counters" and the xperf command stopping the logger was never executed. Making that more robust one way or other seems like a good idea.

Added some comments here explaining what the workflow for collecting the counters is.

I also added code to issue a xperf -stop before we do our current run. I think that's better than generating a different name each time as I've read there is a limit to how many concurrent collections you can have running and, if we know the collection name, we can just try stopping it. I also updated the name to "rustc-perf-counters"

Mark-Simulacrum · 2021-07-03T20:40:12Z

Happy to merge this whenever, looks like there's a few unresolved comments left though -- just let me know.

Also fake the cpu-clock counter because it's necessary to get the self-profiling data to display in the site.

Also update the collection name so it's obvious that it is related to the rustc-perf tool.

wesleywiser · 2021-07-08T15:18:12Z

This is ready to merge! I'll leave it open for now in case @Mark-Simulacrum wants to take another look but if not, I'll merge later today.

wesleywiser requested a review from Mark-Simulacrum June 18, 2021 19:08

Mark-Simulacrum reviewed Jun 18, 2021

View reviewed changes

collector/src/execute.rs Outdated Show resolved Hide resolved

Mark-Simulacrum reviewed Jun 18, 2021

View reviewed changes

collector/README.md Outdated Show resolved Hide resolved

wesleywiser mentioned this pull request Jun 18, 2021

Run perf on Windows #834

Open

6 tasks

wesleywiser force-pushed the hardware_counters_windows branch from 6d3719f to 4c4a5ee Compare June 18, 2021 21:09

rylev approved these changes Jun 24, 2021

View reviewed changes

michaelwoerister reviewed Jun 29, 2021

View reviewed changes

wesleywiser added 13 commits July 7, 2021 15:36

Record hardware performance counters on Windows when benchmarking

0fdf9bf

Add support for capturing self-profile data during xperf

357b9be

Also fake the cpu-clock counter because it's necessary to get the self-profiling data to display in the site.

Collect wall-time on Windows

e5b98eb

Include counters for rustc subprocesses in totals

c0cc729

Refactor event processing

7e54451

Remove unnecessary unwrap()s

246ad8a

Add some docs for running on Windows

371f950

Respond to review feedback

49cf501

Fix failure to process ETW events if CSwitch events happen after P-End

b7a5286

Stop any previously running collection before starting the current one

ae26991

Also update the collection name so it's obvious that it is related to the rustc-perf tool.

Add comment explaining how the PMC collection works

cbdc6e1

Add logging to parse_events()

a7e995a

Run rustfmt

5384e7d

wesleywiser force-pushed the hardware_counters_windows branch from 38bec7b to 5384e7d Compare July 7, 2021 19:52

wesleywiser merged commit c648f39 into rust-lang:master Jul 8, 2021

wesleywiser deleted the hardware_counters_windows branch July 8, 2021 19:01


		anyhow::ensure!(line.starts_with("OS Version"), "OS version line not found");

		let components: Vec<_> = line.split(',').collect();

Support collecting hardware performance counters on Windows #885

Support collecting hardware performance counters on Windows #885

Uh oh!

Conversation

wesleywiser commented Jun 18, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Mark-Simulacrum commented Jun 18, 2021

Uh oh!

wesleywiser commented Jun 18, 2021

Uh oh!

rylev left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wesleywiser commented Jun 25, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wesleywiser Jul 7, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Mark-Simulacrum commented Jul 3, 2021

Uh oh!

wesleywiser commented Jul 8, 2021

Uh oh!

Uh oh!

wesleywiser commented Jun 18, 2021 •

edited

Loading

wesleywiser Jul 7, 2021 •

edited

Loading