Vectorize UTF16 offset calculations #41866

Catfish-Man · 2022-03-17T21:19:12Z

Cleaned up PR and commit history for #41684

Catfish-Man · 2022-03-17T21:20:32Z

@swift-ci please smoke test

Catfish-Man · 2022-03-18T01:11:21Z

@swift-ci Please test Windows platform

glessard

Good stuff. Thanks.

milseman · 2022-03-18T18:49:28Z

@swift-ci please benchmark

stephentyrone · 2022-03-18T19:19:08Z

stdlib/public/core/StringUTF16View.swift

+      let fourBytes = U.zero.replacing(with: U.one, where: uValue .>= 0b11110000)
+      let fourByteCount = Int(fourBytes.wrappedSum())
+
+      utf16Count &+= (U.scalarCount - continuationCount) + fourByteCount


Horizontal operations (wrappedSum) are expensive; usually in SIMD code we keep the accumulation in vector until forced to map down to a scalar rather, than accumulating every loop iteration. It's a little bit of a pain with Int8 because you can only do 127 accumulations before you have to worry about overflow, but still doing one horizontal operation every N iterations is much better than 2/iteration.

The easy change here would be to do:

&+= U.scalarCount + (fourByteCount &- continuationCount).wrappedSum()

so that you only do half as many as currently.

Sure, I'll make a followup patch

(The optimizer might be managing to pull this apart for you already, but it would be nice to make explicit anyway.)

(I'm hoping once wider vectors are feasible that the common case here, post-breadcrumb offsets, will only ever iterate once because we can just do the entire 64 bytes at once)

Vectorize UTF16 offset calculations

eaf3f31

Catfish-Man requested review from milseman and stephentyrone March 17, 2022 21:19

Catfish-Man self-assigned this Mar 17, 2022

Catfish-Man mentioned this pull request Mar 17, 2022

Vectorize UTF16 offset calculations #41684

Closed

glessard approved these changes Mar 18, 2022

View reviewed changes

Catfish-Man merged commit cb082e1 into swiftlang:main Mar 18, 2022

stephentyrone reviewed Mar 18, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Vectorize UTF16 offset calculations #41866

Vectorize UTF16 offset calculations #41866

Uh oh!

Catfish-Man commented Mar 17, 2022

Uh oh!

Catfish-Man commented Mar 17, 2022

Uh oh!

Catfish-Man commented Mar 18, 2022

Uh oh!

glessard left a comment

Uh oh!

milseman commented Mar 18, 2022

Uh oh!

stephentyrone Mar 18, 2022 •

edited

Loading

Uh oh!

Catfish-Man Mar 18, 2022

Uh oh!

stephentyrone Mar 18, 2022

Uh oh!

Catfish-Man Mar 18, 2022

Uh oh!

Catfish-Man Mar 18, 2022

Uh oh!

Uh oh!

Vectorize UTF16 offset calculations #41866

Vectorize UTF16 offset calculations #41866

Uh oh!

Conversation

Catfish-Man commented Mar 17, 2022

Uh oh!

Catfish-Man commented Mar 17, 2022

Uh oh!

Catfish-Man commented Mar 18, 2022

Uh oh!

glessard left a comment

Choose a reason for hiding this comment

Uh oh!

milseman commented Mar 18, 2022

Uh oh!

stephentyrone Mar 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Catfish-Man Mar 18, 2022

Choose a reason for hiding this comment

Uh oh!

stephentyrone Mar 18, 2022

Choose a reason for hiding this comment

Uh oh!

Catfish-Man Mar 18, 2022

Choose a reason for hiding this comment

Uh oh!

Catfish-Man Mar 18, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

stephentyrone Mar 18, 2022 •

edited

Loading