benchmark: add naive string finding #26744

weissi · 2019-08-20T17:32:01Z

FindStringNaive is a simple benchmark which implements a naive String
finding algorithm that currently shows a lot of ARC traffic, hopefully
to be reduced in the future.

this is a benchmark for https://bugs.swift.org/browse/SR-11335

weissi · 2019-08-21T14:32:52Z

@swift-ci please benchmark

swift-ci · 2019-08-21T14:52:57Z

Performance: -O

Regression	OLD	NEW	DELTA	RATIO
CharIndexing_punctuated_unicodeScalars_Backwards	720	800	+11.1%	0.90x (?)
FlattenListFlatMap	3876	4239	+9.4%	0.91x (?)
MapReduceSequence	397	428	+7.8%	0.93x (?)

Improvement	OLD	NEW	DELTA	RATIO
ObjectiveCBridgeStubNSDateRefAccess	196	174	-11.2%	1.13x (?)

Added	MIN	MAX	MEAN	MAX_RSS
FindStringNaive.ArrayUTF8	436	444	439	—
FindStringNaive.String	316	316	316	—
FindStringNaive.Substring	637	637	637	—

Code size: -O

Performance: -Osize

Regression	OLD	NEW	DELTA	RATIO
ObjectiveCBridgeStubDateAccess	130	152	+16.9%	0.86x
FlattenListLoop	2168	2379	+9.7%	0.91x (?)

Improvement	OLD	NEW	DELTA	RATIO
Dictionary4	205	164	-20.0%	1.25x
CharacterLiteralsLarge	74	67	-9.5%	1.10x
StringHasPrefixAscii	1280	1170	-8.6%	1.09x (?)

Added	MIN	MAX	MEAN	MAX_RSS
FindStringNaive.ArrayUTF8	437	444	439	—
FindStringNaive.String	315	315	315	—
FindStringNaive.Substring	637	649	641	—

Code size: -Osize

Performance: -Onone

Improvement	OLD	NEW	DELTA	RATIO
ObjectiveCBridgeStubFromNSStringRef	216	110	-49.1%	1.96x (?)
Breadcrumbs.MutatedUTF16ToIdx.ASCII	15	14	-6.7%	1.07x (?)

Added	MIN	MAX	MEAN	MAX_RSS
FindStringNaive.ArrayUTF8	188100	188277	188197	—
FindStringNaive.String	600	601	601	—
FindStringNaive.Substring	760	760	760	—

Code size: -swiftlibs

✅	Benchmark Check Report

How to read the data

The tables contain differences in performance which are larger than 8% and differences in code size which are larger than 1%.

If you see any unexpected regressions, you should consider fixing the
regressions before you merge the PR.

Noise: Sometimes the performance results (not code size!) contain false
alarms. Unexpected regressions which are marked with '(?)' are probably noise.
If you see regressions which you cannot explain you can try to run the
benchmarks again. If regressions still show up, please consult with the
performance team (@eeckstein).

Hardware Overview

  Model Name: Mac mini
  Model Identifier: Macmini8,1
  Processor Name: Intel Core i7
  Processor Speed: 3.2 GHz
  Number of Processors: 1
  Total Number of Cores: 6
  L2 Cache (per Core): 256 KB
  L3 Cache: 12 MB
  Memory: 64 GB

weissi · 2019-08-21T15:17:29Z

@gottesmm the benchmark is now happy and gives me a green tick. How does this look to you?

weissi · 2019-08-21T15:20:34Z

@milseman also turns out that finding a UInt8 in a [UInt8] is 100x(!!) faster than finding a Character in a String (despite it being UTF-8 with ascii only chars).

gottesmm · 2019-08-21T17:28:31Z

@weissi Just squash the commits then LGTM.

Catfish-Man · 2019-08-21T17:48:03Z

@milseman also turns out that finding a UInt8 in a [UInt8] is 100x(!!) faster than finding a Character in a String (despite it being UTF-8 with ascii only chars).

This is most likely because of having to treat CRLF as a single Character. It prevents a lot of the obvious fast paths.

weissi · 2019-08-21T18:18:09Z

@gottesmm done, thank you!

Catfish-Man · 2019-08-21T22:36:00Z

Please don't edit my comments (??)

weissi · 2019-08-22T09:57:00Z

Please don't edit my comments (??)

So sorry!! I was on my phone as must have clicked 'edit' instead of 'quote reply' and didn't notice :'(

benchmark/single-source/FindStringNaive.swift

weissi · 2019-08-28T16:11:49Z

@swift-ci please benchmark

weissi · 2019-08-28T16:53:03Z

@swift-ci please benchmark

swift-ci · 2019-08-28T17:11:09Z

Build failed before running benchmark.

weissi · 2019-08-28T17:32:13Z

@swift-ci please benchmark

swift-ci · 2019-08-28T18:15:42Z

Build failed before running benchmark.

FindStringNaive is a simple benchmark which implements a naive String finding algorithm that currently shows a lot of ARC traffic, hopefully to be reduced in the future.

weissi · 2019-08-28T18:57:38Z

@swift-ci please benchmark

swift-ci · 2019-08-28T19:24:54Z

Performance: -O

Improvement	OLD	NEW	DELTA	RATIO
CharacterLiteralsLarge	108	97	-10.2%	1.11x (?)
CharacterLiteralsSmall	340	317	-6.8%	1.07x (?)

Added	MIN	MAX	MEAN	MAX_RSS
FindString.Loop1.Array	571	572	571	—
FindString.Loop1.Substring	803	803	803	—
FindString.Rec3.Array	237	238	237	—
FindString.Rec3.String	206	208	207	—
FindString.Rec3.Substring	244	246	245	—

Code size: -O

Performance: -Osize

Regression	OLD	NEW	DELTA	RATIO
FlattenListLoop	5036	5644	+12.1%	0.89x (?)
ObjectiveCBridgeStubDateMutation	257	285	+10.9%	0.90x
Array2D	7520	8112	+7.9%	0.93x
RandomShuffleLCG2	816	880	+7.8%	0.93x (?)

Improvement	OLD	NEW	DELTA	RATIO
ObjectiveCBridgeStubFromNSDateRef	4200	3720	-11.4%	1.13x (?)
ObjectiveCBridgeStubFromNSStringRef	176	156	-11.4%	1.13x (?)
ObjectiveCBridgeStubDateAccess	257	228	-11.3%	1.13x
NSStringConversion.UTF8	834	770	-7.7%	1.08x (?)

Added	MIN	MAX	MEAN	MAX_RSS
FindString.Loop1.Array	571	572	571	—
FindString.Loop1.Substring	803	803	803	—
FindString.Rec3.Array	237	237	237	—
FindString.Rec3.String	211	213	212	—
FindString.Rec3.Substring	243	245	244	—

Code size: -Osize

Performance: -Onone

Added	MIN	MAX	MEAN	MAX_RSS
FindString.Loop1.Array	266727	295816	276477	—
FindString.Loop1.Substring	989	990	989	—
FindString.Rec3.Array	14355	14498	14415	—
FindString.Rec3.String	935	935	935	—
FindString.Rec3.Substring	897	898	898	—

Code size: -swiftlibs

✅	Benchmark Check Report

How to read the data

The tables contain differences in performance which are larger than 8% and differences in code size which are larger than 1%.

If you see any unexpected regressions, you should consider fixing the
regressions before you merge the PR.

Noise: Sometimes the performance results (not code size!) contain false
alarms. Unexpected regressions which are marked with '(?)' are probably noise.
If you see regressions which you cannot explain you can try to run the
benchmarks again. If regressions still show up, please consult with the
performance team (@eeckstein).

Hardware Overview

  Model Name: Mac Pro
  Model Identifier: MacPro6,1
  Processor Name: 12-Core Intel Xeon E5
  Processor Speed: 2.7 GHz
  Number of Processors: 1
  Total Number of Cores: 12
  L2 Cache (per Core): 256 KB
  L3 Cache: 30 MB
  Memory: 64 GB

milseman

LGTM

weissi · 2019-08-29T18:00:32Z

@swift-ci please test

swift-ci · 2019-08-29T18:02:45Z

Build failed
Swift Test Linux Platform
Git Sha - 43a151738c2abfad8548c7360b0cc97522e7c0ac

swift-ci · 2019-08-29T18:04:20Z

Build failed
Swift Test OS X Platform
Git Sha - 43a151738c2abfad8548c7360b0cc97522e7c0ac

weissi · 2019-08-29T19:41:31Z

@swift-ci smoke test and merge

palimondo · 2019-09-03T10:11:09Z

benchmark/single-source/FindStringNaive.swift

+@inline(never)
+func runBenchRecursive3ArrayOfUTF8(iterations: Int) {
+  for _ in 0 ..< iterations {
+    precondition(findThreeOnUTF8Collection(longArrayFoFoFoFox!,


@eeckstein @gottesmm Is the use of precondition here fully equivalent to traditional CheckResults we use in all other benchmarks?

Yeah, CheckResult is better. It should be slightly less overhead because it's inlinable.

weissi requested a review from gottesmm August 20, 2019 17:32

weissi force-pushed the jw-bench-find-string branch from 45acc7a to 7f5a834 Compare August 20, 2019 17:59

swiftlang deleted a comment from swift-ci Aug 20, 2019

weissi force-pushed the jw-bench-find-string branch from 6d8b95f to 7964013 Compare August 21, 2019 07:20