Improve perf measurements of `build_extern_trait_impl` #90363

camelid · 2021-10-28T03:14:00Z

Before, it was only measuring one callsite of build_impl, and it
incremented the call count even if build_impl returned early because
the did was already inlined.

Now, it measures all calls, minus calls that return early.

rust-highfive · 2021-10-28T03:14:03Z

r? @GuillaumeGomez

(rust-highfive has picked a reviewer for you, use r? to override)

camelid · 2021-10-28T03:14:15Z

@bors try @rust-timer queue

rust-timer · 2021-10-28T03:14:17Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2021-10-28T03:14:23Z

⌛ Trying commit 37e56fa8115203f422fe7fef84fa285a3a072e20 with merge c759bedb91d5b1f6fe5cd357d5b17e6141d6485f...

Before, it was only measuring one callsite of `build_impl`, and it incremented the call count even if `build_impl` returned early because the `did` was already inlined. Now, it measures all calls, minus calls that return early.

camelid · 2021-10-28T03:15:56Z

@bors try @rust-timer queue

rust-timer · 2021-10-28T03:15:57Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2021-10-28T03:16:03Z

⌛ Trying commit eb713d2 with merge e9fd48d4c67a3126ae9b2d3bc6ad81753457c689...

bors · 2021-10-28T04:34:24Z

☀️ Try build successful - checks-actions
Build commit: e9fd48d4c67a3126ae9b2d3bc6ad81753457c689 (e9fd48d4c67a3126ae9b2d3bc6ad81753457c689)

rust-timer · 2021-10-28T04:34:26Z

Queued e9fd48d4c67a3126ae9b2d3bc6ad81753457c689 with parent 4e0d397, future comparison URL.

rust-timer · 2021-10-28T06:05:50Z

Finished benchmarking commit (e9fd48d4c67a3126ae9b2d3bc6ad81753457c689): comparison url.

Summary: This benchmark run did not return any relevant changes.

If you disagree with this performance assessment, please file an issue in rust-lang/rustc-perf.

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR led to changes in compiler perf.

@bors rollup=never
@rustbot label: +S-waiting-on-review -S-waiting-on-perf -perf-regression

jyn514 · 2021-10-28T09:50:26Z

Hmm, when I added this it was intentional to only measure collect_trait_impls, since that's the hottest loop, and .insert already has some overhead I think it's important to measure. What other call sites are there for build_trait?

camelid · 2021-10-28T22:02:12Z

Hmm, when I added this it was intentional to only measure collect_trait_impls, since that's the hottest loop, and .insert already has some overhead I think it's important to measure. What other call sites are there for build_trait?

src/librustdoc/clean/utils.rs
194:                inline::build_impl(cx, None, did, None, ret);

src/librustdoc/clean/inline.rs
295:        build_impl(cx, parent_module, did, attrs, ret);

src/librustdoc/passes/collect_trait_impls.rs
35:                inline::build_impl(cx, None, did, None, &mut new_items);
44:                inline::build_impl(cx, None, def_id, None, &mut new_items);
117:                inline::build_impl(cx, None, impl_did, Some(&extra_attrs), &mut new_items);

camelid · 2021-10-28T22:05:22Z

Hmm, when I added this it was intentional to only measure collect_trait_impls, since that's the hottest loop

What would the downside be of measuring all uses though?

.insert already has some overhead I think it's important to measure

I would think the overhead of .insert is probably negligible though. Also, having those measurements is not really actionable, and IMO the improved accuracy of the execution count is more helpful than measuring the likely small amount of time .insert takes.

jyn514 · 2021-10-28T23:51:53Z

Sure, I mean, at the end of the day anything this detailed will probably want to use cachegrind instead. I'm find with either landing this or not.

camelid · 2021-10-29T00:59:02Z

Sure, I mean, at the end of the day anything this detailed will probably want to use cachegrind instead. I'm find with either landing this or not.

The two reasons I'm proposing this change are (1) I want more accurate execution count numbers and (2) IMO it's more consistent to measure all calls of build_impl, not just a callsite that happens to be hot. I'm planning to at some point look into using cachegrind or a similar tool to investigate perf too :)

jyn514 · 2021-10-29T01:08:19Z

@bors r+ rollup=never

bors · 2021-10-29T01:08:20Z

📌 Commit eb713d2 has been approved by jyn514

bors · 2021-10-29T01:50:11Z

⌛ Testing commit eb713d2 with merge a9f664f...

bors · 2021-10-29T04:55:46Z

☀️ Test successful - checks-actions
Approved by: jyn514
Pushing a9f664f to master...

rust-timer · 2021-10-29T06:27:58Z

Finished benchmarking commit (a9f664f): comparison url.

Summary: This benchmark run did not return any relevant changes.

If you disagree with this performance assessment, please file an issue in rust-lang/rustc-perf.

@rustbot label: -perf-regression

camelid added the T-rustdoc Relevant to the rustdoc team, which will review and decide on the PR/issue. label Oct 28, 2021

rust-highfive assigned GuillaumeGomez Oct 28, 2021

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Oct 28, 2021

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Oct 28, 2021

Improve perf measurements of build_extern_trait_impl

eb713d2

Before, it was only measuring one callsite of `build_impl`, and it incremented the call count even if `build_impl` returned early because the `did` was already inlined. Now, it measures all calls, minus calls that return early.

camelid force-pushed the build-impl-perf branch from 37e56fa to eb713d2 Compare October 28, 2021 03:15

rustbot removed the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Oct 28, 2021

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Oct 29, 2021

bors added the merged-by-bors This PR was explicitly merged by bors. label Oct 29, 2021

bors merged commit a9f664f into rust-lang:master Oct 29, 2021

rustbot added this to the 1.58.0 milestone Oct 29, 2021

camelid deleted the build-impl-perf branch October 29, 2021 18:18

Improve perf measurements of build_extern_trait_impl #90363

Improve perf measurements of build_extern_trait_impl #90363

Uh oh!

Conversation

camelid commented Oct 28, 2021

Uh oh!

rust-highfive commented Oct 28, 2021

Uh oh!

camelid commented Oct 28, 2021

Uh oh!

rust-timer commented Oct 28, 2021

Uh oh!

bors commented Oct 28, 2021

Uh oh!

camelid commented Oct 28, 2021

Uh oh!

rust-timer commented Oct 28, 2021

Uh oh!

bors commented Oct 28, 2021

Uh oh!

bors commented Oct 28, 2021

Uh oh!

rust-timer commented Oct 28, 2021

Uh oh!

rust-timer commented Oct 28, 2021

Uh oh!

jyn514 commented Oct 28, 2021

Uh oh!

camelid commented Oct 28, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

camelid commented Oct 28, 2021

Uh oh!

jyn514 commented Oct 28, 2021

Uh oh!

camelid commented Oct 29, 2021

Uh oh!

jyn514 commented Oct 29, 2021

Uh oh!

bors commented Oct 29, 2021

Uh oh!

bors commented Oct 29, 2021

Uh oh!

bors commented Oct 29, 2021

Uh oh!

rust-timer commented Oct 29, 2021

Uh oh!

Uh oh!

Improve perf measurements of `build_extern_trait_impl` #90363

Improve perf measurements of `build_extern_trait_impl` #90363

camelid commented Oct 28, 2021 •

edited

Loading