Dynamic type lookup string compare optimization #42390

ladd · 2022-04-15T17:09:48Z

This is a small optimization for a specific path of dynamic type lookup, as profiled below in our iOS application. In a small synthetic benchmark, using strncmp is roughly twice as fast as using strlen + memcmp via StringRef in the case where the two mangled type strings match.

(missing frames due to inlining?)

Other ideas for optimizing this path welcome!

stdlib/public/runtime/MetadataLookup.cpp

mikeash · 2022-04-20T13:00:50Z

@swift-ci please benchmark

ladd · 2022-04-20T19:21:00Z

Seems to be within the noise, if I'm understanding the report. Does this need a smoke test as well? (I don't have commit access)

mikeash · 2022-04-20T19:34:15Z

Yeah, looks like no measurable difference (no surprise there) but still seems worth doing. I'll go ahead and run a full test.

mikeash · 2022-04-20T19:34:19Z

@swift-ci please test

ladd · 2022-04-21T18:25:08Z

Thanks for running the tests. Anything else to do here?

jckarter · 2022-04-21T18:29:26Z

If this is a real win, it might be nice to upstream it to LLVM as an additional overload for StringRef::equals, so that all uses of StringRef::equals with a C string argument in the compiler and runtime benefit.

jckarter · 2022-04-21T18:34:47Z

stdlib/public/runtime/MetadataLookup.cpp

+  size_t length = s1.size();
+  // It may be possible for s1 to contain embedded NULL characters
+  // so additionally validate that the lengths match
+  return strncmp(s1.data(), s2, length) == 0 && strlen(s2) == length;


It's a bit unfortunate to have to iterate through s2 twice here. What does performance look like if we use an inline loop here instead of strncmp to do the entire comparison in one pass? It'd be interesting to see if strncmp and/or strlen are optimized enough to offset the double iteration through s2 compared to a loop that precisely accounted for s1's fixed length and s2's null termination in one pass.

A naive inline loop seems to be ≈3x slower than platform strncmp in my tests with a 65 byte long sample string. I suspect this is due to the optimized ARM assembly implementations in llvm-project/libc/AOR_v20.02/string/aarch64 + the compiler's optimization of built-ins. That said, my assembly skills are a bit rusty.

I tried something like this with -O3:

inline static bool testcmp(const char *left, const char *right, size_t n) { for (int i=0; i < n; i++) { char lc = left[i]; char rc = right[i]; if (lc != rc || rc == '\0') { return false; } } return true; }

In that case, it looks good as is. Thanks for trying it out!

No worries -- at this level of the code, I think you could only do better if you already knew the length of s2, or knew that s1 contained no embedded NUL characters.

mikeash · 2022-04-21T20:54:09Z

@jckarter Are you good with merging now? Getting it into LLVM sounds nice but as a separate thing.

jckarter · 2022-04-21T20:54:44Z

@mikeash Yeah, I'm fine merging this now.

ladd changed the title ~~String compare optimization~~ Dynamic type lookup string compare optimization Apr 15, 2022

String compare optimization

e24b433

ladd force-pushed the stringRefEqualsCString branch from 7d0915e to e24b433 Compare April 15, 2022 17:13

tbkka requested review from mikeash and al45tair April 15, 2022 17:25

mikeash reviewed Apr 15, 2022

View reviewed changes

stdlib/public/runtime/MetadataLookup.cpp Outdated Show resolved Hide resolved

Adopt suggestion from @mikeash

f4d0c3c

jckarter reviewed Apr 21, 2022

View reviewed changes

jckarter merged commit 2afb9b7 into swiftlang:main Apr 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Dynamic type lookup string compare optimization #42390

Dynamic type lookup string compare optimization #42390

Uh oh!

ladd commented Apr 15, 2022 •

edited

Loading

Uh oh!

Uh oh!

mikeash commented Apr 20, 2022

Uh oh!

ladd commented Apr 20, 2022

Uh oh!

mikeash commented Apr 20, 2022

Uh oh!

mikeash commented Apr 20, 2022

Uh oh!

ladd commented Apr 21, 2022

Uh oh!

jckarter commented Apr 21, 2022

Uh oh!

jckarter Apr 21, 2022 •

edited

Loading

Uh oh!

ladd Apr 21, 2022 •

edited

Loading

Uh oh!

jckarter Apr 21, 2022

Uh oh!

ladd Apr 21, 2022

Uh oh!

mikeash commented Apr 21, 2022

Uh oh!

jckarter commented Apr 21, 2022

Uh oh!

Uh oh!

Dynamic type lookup string compare optimization #42390

Dynamic type lookup string compare optimization #42390

Uh oh!

Conversation

ladd commented Apr 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

mikeash commented Apr 20, 2022

Uh oh!

ladd commented Apr 20, 2022

Uh oh!

mikeash commented Apr 20, 2022

Uh oh!

mikeash commented Apr 20, 2022

Uh oh!

ladd commented Apr 21, 2022

Uh oh!

jckarter commented Apr 21, 2022

Uh oh!

jckarter Apr 21, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ladd Apr 21, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jckarter Apr 21, 2022

Choose a reason for hiding this comment

Uh oh!

ladd Apr 21, 2022

Choose a reason for hiding this comment

Uh oh!

mikeash commented Apr 21, 2022

Uh oh!

jckarter commented Apr 21, 2022

Uh oh!

Uh oh!

ladd commented Apr 15, 2022 •

edited

Loading

jckarter Apr 21, 2022 •

edited

Loading

ladd Apr 21, 2022 •

edited

Loading