[stdlib] Force-inline some Sequence/Collection customization points #19683

lorentey · 2018-10-03T12:22:01Z

This should be a code size win for specialized code at least, but I suspect there may be some performance improvements, too.

…tomization points This should eliminate a branch, which should probably lead to a tiny overall code size improvement, as well as a tiny performance boost.

…ence/Collection customization points This is usually a code size pessimization, but in this case the bodies are trivial, so inlining them eliminates a call + a conditional branch.

lorentey · 2018-10-03T12:22:12Z

@swift-ci smoke benchmark

lorentey · 2018-10-03T12:22:22Z

@swift-ci smoke test

swift-ci · 2018-10-03T12:48:31Z

Build comment file:

Performance: -O

TEST	OLD	NEW	DELTA	RATIO
Improvement
DictionaryKeysContainsNative	45	40	-11.1%	1.12x

Code size: -O

TEST	OLD	NEW	DELTA	RATIO
Improvement
DictionaryKeysContains.o	18963	17043	-10.1%	1.11x

Performance: -Osize

TEST	OLD	NEW	DELTA	RATIO
Regression
UTF8Decode_InitFromBytes_ascii	490	582	+18.8%	0.84x
Improvement
Array2D	13194	11998	-9.1%	1.10x

Code size: -Osize

TEST	OLD	NEW	DELTA	RATIO
Improvement
DictionaryKeysContains.o	16859	15003	-11.0%	1.12x
NibbleSort.o	20000	19616	-1.9%	1.02x

How to read the data

The tables contain differences in performance which are larger than 8% and differences in code size which are larger than 1%.

If you see any unexpected regressions, you should consider fixing the regressions before you merge the PR.

Noise: Sometimes the performance results (not code size!) contain false alarms. Unexpected regressions which are marked with '(?)' are probably noise. If you see regressions which you cannot explain you can try to run the benchmarks again. If regressions still show up, please consult with the performance team (@eeckstein).

Hardware Overview

  Model Name: Mac Pro
  Model Identifier: MacPro6,1
  Processor Name: 12-Core Intel Xeon E5
  Processor Speed: 2.7 GHz
  Number of Processors: 1
  Total Number of Cores: 12
  L2 Cache (per Core): 256 KB
  L3 Cache: 30 MB
  Memory: 64 GB

lorentey · 2018-10-03T14:42:12Z

Results are as expected. Would've loved to see movement beyond Dictionary.Keys, although I guess that is the primary benefactor right now.

lorentey added 2 commits October 3, 2018 13:14

[stdlib] Dictionary, Set: Force-inline hidden Sequence/Collection cus…

2f0f43a

…tomization points This should eliminate a branch, which should probably lead to a tiny overall code size improvement, as well as a tiny performance boost.

[stdlib] Force-inline trivial default implementations for hidden Sequ…

4615e18

…ence/Collection customization points This is usually a code size pessimization, but in this case the bodies are trivial, so inlining them eliminates a call + a conditional branch.

lorentey merged commit 20bb815 into swiftlang:master Oct 3, 2018

lorentey deleted the inline-customization-points branch October 3, 2018 14:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[stdlib] Force-inline some Sequence/Collection customization points #19683

[stdlib] Force-inline some Sequence/Collection customization points #19683

Uh oh!

lorentey commented Oct 3, 2018

Uh oh!

lorentey commented Oct 3, 2018

Uh oh!

lorentey commented Oct 3, 2018

Uh oh!

swift-ci commented Oct 3, 2018

Uh oh!

lorentey commented Oct 3, 2018 •

edited

Loading

Uh oh!

Uh oh!

[stdlib] Force-inline some Sequence/Collection customization points #19683

[stdlib] Force-inline some Sequence/Collection customization points #19683

Uh oh!

Conversation

lorentey commented Oct 3, 2018

Uh oh!

lorentey commented Oct 3, 2018

Uh oh!

lorentey commented Oct 3, 2018

Uh oh!

swift-ci commented Oct 3, 2018

Build comment file:

Performance: -O

Code size: -O

Performance: -Osize

Code size: -Osize

Uh oh!

lorentey commented Oct 3, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

lorentey commented Oct 3, 2018 •

edited

Loading