[CodeComplete] Compute type relations for global cached results #41000

ahoppen · 2022-01-25T17:37:48Z

Computing the type relation for every item in the code completion cache is way to expensive (~4x slowdown for global completion that imports SwiftUI). Instead, compute a type’s supertypes (protocol conformances and superclasses) once and write their USRs to the cache. To compute a type relation we can then check if the contextual type is in the completion item’s supertypes.

This reduces the overhead of computing the type relations (again global completion that imports SwiftUI) to ~6% – measured by instructions executed.

Technically, we might miss some conversions like

retroactive conformances inside another module (because we can’t cache them if that other module isn’t imported)
complex generic conversions (just too complicated to model using USRs)

Because of this, we never report an unrelated type relation for global items but always default to unknown.

But I believe this change covers the most common cases and is a good tradeoff between accuracy and performance.

rdar://83846531

@rintaro I needed to undo the change you suggested in #40999 (comment):

When we were only storing AST-based types, we could have a boolean flag to also consider metatypes. This trick is harder to pull off with USR-based types, because we can’t just invoke the MetatypeType constructor with the instance type.

We could hack together a USR-based instance type to Metatype transformation by appending an "m" to the instance type’s USR but I decided to store the metatype as another possible result type of CodeCompletionResultType because:

Performing USR manipulations when computing type relations seems pretty fragile to me
We don’t have an option to store a metatype’s supertypes if we just append an "m" to the USR and thus can’t determine "Convertible" type relations for metatypes (to be fair we can’t do this today either but it should be possible by computing the metatype supertypes in USRBasedType::fromType
I think we can afford to save another word by making CodeCompletionResultType contain SmallVector<Type, 1> instead of Type.

ahoppen · 2022-01-25T17:37:59Z

@swift-ci Please smoke test macOS

ahoppen · 2022-02-17T11:38:44Z

@swift-ci Please smoke test

ahoppen · 2022-02-21T11:13:04Z

@swift-ci Please smoke test

rintaro

I'm still thinking how to avoid using vector for result type...

lib/IDE/CodeCompletion.cpp

rintaro · 2022-02-22T06:45:41Z

lib/IDE/CodeCompletionResultType.cpp

+      for (auto Proto : OpaqueType->getConformsTo()) {
+        USRConforms.push_back(
+            USRBasedType::fromType(Proto->getDeclaredInterfaceType(), Arena));
+      }


Handle Opaque->getSuperclass() too?

class MyClass {} protocol MyProto {} func fn<T: MyClass & MyProto>() -> T { return #^HERE^# }

Which result are you expecting here?

class MyClass {} protocol MyProto {} func fn<T: MyClass & MyProto>() -> T { return MyClass() // error: cannot convert return expression of type 'MyClass' to return type 'T' }

When adding MyClass: MyProto.

class MyClass: MyProto {} protocol MyProto {} func fn<T: MyClass & MyProto>() -> T { // warning: redundant conformance constraint 'T' : 'MyProto' return MyClass() // error: cannot convert return expression of type 'MyClass' to return type 'T' }

Hm, OK, we can't know the actual expected type if the context type is a generic parameter. But that applies to this as well, right?

class MyClass {} func fn<T: MyProto>() -> T { return #^HERE^# }

So should we limit this to OpaqueTypeArchetypeType?

Anyway, I believe we still need to handle getSuperclass() too.

class C {} class P {} class D: C, P {} class E: P {} func fn() -> some C & P { return #^HERE^# }

D is OK, but E is not.

Yeah, that makes sense. Implemented in 10d5d0d

rintaro · 2022-02-22T07:29:40Z

lib/IDE/CodeCompletionResultType.cpp

+}
+
+const USRBasedType *USRBasedType::fromType(Type Ty, USRBasedTypeArena &Arena) {
+  // FIXME: We can't mangle archetypes, so we treat them as null USRBasedTypes.


An example of archetype case is something like this?

func returnCollecton<T: MyProto>() -> T {}

If so, can we construct USRBasedType with empty USR and non-empty super types?

This actually works. We are reporting convertible type relations for types that conform to MyProto here. I added a test case for it. (dab4a2f)

I mean when suggesting it. The test case would be something like:

//-- Lib.swift public protocol MyProto {} public func returnSomeMyProto() -> some MyProto { ... } //-- test.swift import Lib func foo<T: MyProto>(_: T) func test() { foo(#^HERE^#) // 'returnSomeMyProto()' should be prioritized. }

I see, I decided to special case some with a single protocol for now as representing the conjunction of two types isn’t currently supported in USRBasedType and it’s not too simple to put it in. Since the entire USRBasedType thing is just an under-approximation anyway, I think it’s fine if we can’t represent some MyProto & MyOtherProto for now. 6aff8ef

include/swift/IDE/CodeCompletionResultType.h

lib/IDE/CodeCompletionResultType.cpp

include/swift/IDE/CodeCompletionResultType.h

ahoppen · 2022-02-22T14:24:59Z

I'm still thinking how to avoid using vector for result type…

I actually think the vector is a rather nice solution.

It solves the problem in a pretty elegant way that doesn’t require any USR-manipulations or anything of the sort
It doesn’t consume too much additional memory (just one size_t for the SmallVector<_, 1> in the common case that there is only one result type)
It’s extensible should we find other code completion items that can have multiple result type because there are multiple potential candidates where this might come in handy at some point. I can’t think of any concrete examples but I have a feeling that we might discover more completion results that have such a type duality.

ahoppen · 2022-02-22T14:25:13Z

@swift-ci Please smoke test

rintaro · 2022-02-22T19:47:14Z

include/swift/IDE/CodeCompletionResultType.h

+  /// This is not a single unique type because for code completion we consider
+  /// e.g. \c Int as producing both an \c Int metatype and an \c Int instance
+  /// type.
+  SmallVector<PointerUnion<Type, const USRBasedType *>, 1> ResultTypes;


I understand that using a "collection" might be a good solution.

However, again, CodeCompletionResultType resides in BumpPtrAllocator so we can't use SmallVector which requires the destructor call. ArrayRef might be OK, but how about using llvm::TrailingObject so we can pack the size() with IsNotApplicable flag, and save a pointer size of data(). USRBasedType could use TrailingObject too...

(I'm OK withArrayRef for now. TrailingObject can be followups )

Ah, now I finally understand what your concern about the collection was. As discussed offline, I decided to store the result types using two pointers since we don’t need a collection with more than 2 elements at the moment anyway 9ffc3a0

include/swift/IDE/CodeCompletion.h

rintaro · 2022-02-22T20:05:58Z

lib/IDE/CodeCompletionResultType.cpp

+      for (auto Proto : OpaqueType->getConformsTo()) {
+        USRConforms.push_back(
+            USRBasedType::fromType(Proto->getDeclaredInterfaceType(), Arena));
+      }


Hm, OK, we can't know the actual expected type if the context type is a generic parameter. But that applies to this as well, right?

class MyClass {} func fn<T: MyProto>() -> T { return #^HERE^# }

So should we limit this to OpaqueTypeArchetypeType?

Anyway, I believe we still need to handle getSuperclass() too.

class C {} class P {} class D: C, P {} class E: P {} func fn() -> some C & P { return #^HERE^# }

D is OK, but E is not.

rintaro · 2022-02-22T20:14:13Z

lib/IDE/CodeCompletionResultType.cpp

+}
+
+const USRBasedType *USRBasedType::fromType(Type Ty, USRBasedTypeArena &Arena) {
+  // FIXME: We can't mangle archetypes, so we treat them as null USRBasedTypes.


I mean when suggesting it. The test case would be something like:

//-- Lib.swift public protocol MyProto {} public func returnSomeMyProto() -> some MyProto { ... } //-- test.swift import Lib func foo<T: MyProto>(_: T) func test() { foo(#^HERE^#) // 'returnSomeMyProto()' should be prioritized. }

rintaro · 2022-02-22T20:32:09Z

lib/IDE/CodeCompletionResultType.cpp

+  if (auto Nominal = Ty->getAnyNominal()) {
+    // Sorted conformances so we get a deterministic supertype order and can
+    // assert that USRBasedTypes with the same USR have the same supertypes.
+    auto Conformances = Nominal->getAllConformances(/*sorted=*/true);


protocol P1 {} protocol P2: P1 {} class B: P2 {} protocol Q1 {} protocol Q2: Q1 {} class C: B, Q2 {}

In this case, USRBasedType::forType(C) would be like

p1 = {"P1"} p2 = {"P2", &p1} b = {"B", &p1, &p2} q1 = {"Q1"} q2 = {"Q2", &q1} c = {"C", &p1, &p2, &q1, &q2, &b}

right? This looks like a waste of the space. Could you limit Supertypes only contains the "direct" super types? So the structure be like:

p1 = {"P1"} p2 = {"P2", &p1} b = {"B", &p2} q1 = {"Q1"} q2 = {"Q2", &q1} c = {"C", &q2, &b}

When calculating the type relation, we should be able to collect all super type USRs by recursively look into the graph. Of course we need to be careful not to visit the same node twice, but it shouldn't be hard.

Good idea. It might slow down the computation of USRBasedTypes a little but since that should only happen once to build up the cache, I think it’s a good tradeoff daf8fb0

ahoppen · 2022-02-25T17:12:15Z

@swift-ci Please smoke test

ahoppen · 2022-02-25T17:48:57Z

@swift-ci Please smoke test

ahoppen · 2022-02-25T17:53:47Z

@swift-ci Please smoke test

rintaro

Added some comment, but nothing critical. We can do them in followups

rintaro · 2022-02-25T19:02:49Z

lib/IDE/CodeCompletionResultType.cpp

+
+  // For opaque types like 'some View', consider them equivalent to 'View'.
+  if (auto OpaqueType = Ty->getAs<ArchetypeType>()) {
+    if (auto Existential = OpaqueType->getExistentialType()) {


Does this support protocol with associated types? Adding a test case would be great.

It does, but while playing around with them I discovered another bug

public protocol ProtoWithAssocType { associatedtype MyAssoc } public func makeProtoWithAssocType() -> some ProtoWithAssocType { return StructWithAssocType() } func protoWithAssocTypeInGenericContext<T: ProtoWithAssocType>() -> T { return #^COMPLETE^# }

would suggest makeProtoWithAssocType as “Identical” even though it is not valid here. The problem was that we were doing some of the special opaque return type handling for USRBasedTypes for all archetypes. Fix was to only apply them to OpaqueTypeArchetypeType. This regressed test cases which are using a generic context, but I think that’s fine since the main known limitation of the USR-based type comparisons is that they don’t support generic contexts. 5a1dd49

rintaro · 2022-02-25T19:19:20Z

lib/IDE/CodeCompletionResultType.cpp

+    return TypeRelation::Identical;
+  }
+  for (auto *Supertype : ResultType->getSupertypes()) {
+    if (this->typeRelation(Supertype, VoidType) >= TypeRelation::Convertible) {


When the hierarchy contains diamond conformance,

protocol Base {} protocol P1: Base {} protocol P2: Base {} struct S: P1, P2 {}

I think Base is visited twice. I'm not sure this happens lot, and not sure it causes excessive waste. But this logic should have "visited" check.

Added it b12d82a

rintaro · 2022-02-25T19:26:00Z

include/swift/IDE/CodeCompletionResultType.h

+                           PointerUnion<Type, const USRBasedType *> ResultType1,
+                           PointerUnion<Type, const USRBasedType *> ResultType2)
+      : ResultType1AndIsApplicable(ResultType1, IsApplicable),
+        ResultType2(ResultType2) {}


Insert assert(ResultType1.is<Type>() == ResultType2.is<Type>()); or do we have a scenario mixing them?

We don’t have a scenario where we’re mixing them but we never make assumptions that both of them are of the same kind either. Adding the assertion just adds an artificial restriction that’s not necessary IMO.

ahoppen · 2022-02-28T10:36:23Z

@swift-ci Please smoke test

ahoppen · 2022-03-01T07:31:30Z

@swift-ci Please SourceKit stress test

Computing the type relation for every item in the code completion cache is way to expensive (~4x slowdown for global completion that imports `SwiftUI`). Instead, compute a type’s supertypes (protocol conformances and superclasses) once and write their USRs to the cache. To compute a type relation we can then check if the contextual type is in the completion item’s supertypes. This reduces the overhead of computing the type relations (again global completion that imports `SwiftUI`) to ~6% – measured by instructions executed. Technically, we might miss some conversions like - retroactive conformances inside another module (because we can’t cache them if that other module isn’t imported) - complex generic conversions (just too complicated to model using USRs) Because of this, we never report an `unrelated` type relation for global items but always default to `unknown`. But I believe this change covers the most common cases and is a good tradeoff between accuracy and performance. rdar://83846531

ahoppen · 2022-03-01T08:16:57Z

@swift-ci Please smoke test

ahoppen · 2022-03-01T09:10:32Z

@swift-ci Please SourceKit stress test

Sendable conformances are lazily synthesized as they are needed by the compiler. Depending on whether we checked whether a type conforms to Sendable before constructing the USRBasedType, we get different results for its conformance, causing assertion failures. For now, always ignore the Sendable conformance. We need to come up with a better solution for this problem in the future.

…based types Previously, for every type, we were reconstructing its entire supertype hierarchy as USRBasedTypes before checking the already computed USRBasedTypes. This lead to unneccary duplicated work.

ahoppen · 2022-03-02T09:02:24Z

@swift-ci Please SourceKit stress test

ahoppen · 2022-03-02T09:03:23Z

@swift-ci Please smoke test

hamishknight · 2022-03-02T21:46:15Z

https://ci.swift.org/job/oss-swift-pr-test-macoss/123 started failing after this change, reverting in #41627

ahoppen force-pushed the pr/compute-global-type-relations branch from afb517a to ff4114c Compare February 17, 2022 11:36

ahoppen marked this pull request as ready for review February 17, 2022 11:38

ahoppen requested a review from rintaro February 17, 2022 11:38

ahoppen force-pushed the pr/compute-global-type-relations branch from ff4114c to 26b045e Compare February 21, 2022 11:12

rintaro reviewed Feb 22, 2022

View reviewed changes

ahoppen force-pushed the pr/compute-global-type-relations branch from ebdd76f to 9ffc3a0 Compare February 25, 2022 17:05

ahoppen force-pushed the pr/compute-global-type-relations branch from 2d66a9c to d5a8435 Compare February 25, 2022 17:53

rintaro approved these changes Feb 25, 2022

View reviewed changes

ahoppen force-pushed the pr/compute-global-type-relations branch from d5a8435 to b12d82a Compare February 28, 2022 10:31

rintaro approved these changes Feb 28, 2022

View reviewed changes

ahoppen force-pushed the pr/compute-global-type-relations branch from b12d82a to 640cfac Compare March 1, 2022 08:16

ahoppen added 2 commits March 1, 2022 20:12

[CodeCompletion] Check USRBasedTypeCache earlier when converting AST-…

2d87dd3

…based types Previously, for every type, we were reconstructing its entire supertype hierarchy as USRBasedTypes before checking the already computed USRBasedTypes. This lead to unneccary duplicated work.

rintaro approved these changes Mar 2, 2022

View reviewed changes

ahoppen merged commit 910c2a4 into swiftlang:main Mar 2, 2022

ahoppen deleted the pr/compute-global-type-relations branch March 2, 2022 18:02

hamishknight mentioned this pull request Mar 2, 2022

Revert "[CodeComplete] Compute type relations for global cached results" #41627

Merged

ahoppen mentioned this pull request Mar 2, 2022

[CodeComplete] Compute type relations for global cached results (again) #41632

Merged

[CodeComplete] Compute type relations for global cached results #41000

[CodeComplete] Compute type relations for global cached results #41000

Uh oh!

Conversation

ahoppen commented Jan 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ahoppen commented Jan 25, 2022

Uh oh!

ahoppen commented Feb 17, 2022

Uh oh!

ahoppen commented Feb 21, 2022

Uh oh!

rintaro left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rintaro Feb 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ahoppen Feb 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rintaro Feb 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ahoppen commented Feb 22, 2022

Uh oh!

ahoppen commented Feb 22, 2022

Uh oh!

rintaro Feb 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rintaro Feb 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rintaro Feb 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rintaro Feb 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ahoppen commented Feb 25, 2022

Uh oh!

ahoppen commented Feb 25, 2022

Uh oh!

ahoppen commented Feb 25, 2022

Uh oh!

rintaro left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ahoppen commented Jan 25, 2022 •

edited

Loading

rintaro Feb 22, 2022 •

edited

Loading

ahoppen Feb 22, 2022 •

edited

Loading

rintaro Feb 22, 2022 •

edited

Loading

rintaro Feb 22, 2022 •

edited

Loading

rintaro Feb 22, 2022 •

edited

Loading

rintaro Feb 22, 2022 •

edited

Loading

rintaro Feb 22, 2022 •

edited

Loading

rintaro Feb 25, 2022 •

edited

Loading