EscapeAnalysis cleanup and add utilities [nearly NFC] #28103

atrick · 2019-11-06T04:45:08Z

This is the first in a series of patches that reworks
EscapeAnalysis. For this patch, I extracted every change that does not
introduce new features, rewrite logic, or appear to change
functionality.

These cleanups were done in preparation for:

- adding a graph representation for reference counted objects

- rewriting parts to the query logic

- ...which then allows the analysis to safely assume that all
  exclusive arguments are unique

- ...which then allows more aggressive optimization of local variables
  that don't escape

There are two possible functional changes:

getUnderlyingAddressRoot in InstructionUtils now sees through OSSA
instructions: begin_borrow and copy_value
The getPointerBase helper in EscapeAnalysis now sees through all of
these reference and pointer casts:
- case ValueKind::UncheckedRefCastInst:
- case ValueKind::ConvertFunctionInst:
- case ValueKind::UpcastInst:
- case ValueKind::InitExistentialRefInst:
- case ValueKind::OpenExistentialRefInst:
- case ValueKind::RawPointerToRefInst:
- case ValueKind::RefToRawPointerInst:
- case ValueKind::RefToBridgeObjectInst:
- case ValueKind::BridgeObjectToRefInst:
- case ValueKind::UncheckedAddrCastInst:
- case ValueKind::UnconditionalCheckedCastInst:
- case ValueKind::RefTo##Name##Inst:
- case ValueKind::Name##ToRefInst:
This coalesces a whole bunch of nodes together that were just there
because of casts. The existing code was already doing this for one
level of casts, but there was a bug that prevented it from happening
transitively. So, in theory, anything that breaks with this fix could
also break without this fix, but may not have been exposed. The fact
that this analysis coalesces address-to-reference casts at all is what
caused me to spent vast amounts of time debugging any time I tried to
force some structure on the graph via assertions. If it is done at
all, it should be done everywhere consistently to expose issues as
early as possible.

Here is a description of the changes in diff order. If something in
the diff is not listed here, then the code probably just moved around
in the file:

Rename isNotAliasedIndirectParameter to
isExclusiveIndirectParameter. The argument may be aliased in the
caller's scope and it's contents may have already escaped.

Add comments to SILType APIs (isTrivial, isReferenceCounted) that give
answers about the AST type which don't really make sense for address
SILTypes.

Add comments about CGNode's 'Value' field. I spent lots of time
attempting to tighten this down with asserts, but it's only possible
for non-content nodes. For content nodes, the node's value is highly
unpredictable and basically nonsense but needed for debugging.

Add comments about not assuming that the content nodes pointsTo edge
represents physical indirection. This matters when reasoning about
aliasing and it's a tempting assumption to make.

Add a CGNode::mergeProperties placeholder for adding node properties.

Factor out a CGNode::canAddDeferred helper for use later.

Rename setPointsTo to setPointsToEdge because it actually creates
an edge rather than just setting pointsTo.

Add CGNode::getValue() and related helpers to help maintain invariants.

Factor out a markEscaping helper.

Clean up the escapesInsideFunction helper.

Add node visitor helpers: visitSuccessors, visitDefers. This made is
much easier to prototype utilities.

Add comments to clarify the pointsTo invariant. In particular, an
entire defer web may have a null pointsTo for a while.

Add an activeWorklist to avoid nasty bugs when composing multiple
helpers that each use the worklist.

Remove the EA argument from getNode. I ended up needing access to
the EA context from the ConnectionGraph many times during
prototyping and passing this was all the getNode calls was very
silly.

Add graph visitor helpers: backwardTraverse, forwardTraverseDefer,
forwardTraversePointsToEdges, and mayReach for ease in developing new
logic and utilities.

Add isExclusiveArgument helper and distinguish exclusive arguments
from local objects. Confusing these properties could lead to scary
bugs. For example, unlike a local object, an exclusive argument's
contents may still escape even when the content's connection graph
node is non-escaping!

Add isUniquelyIdentified helper when we want to treat exclusive
arguments and local objects uniformly.

getUnderlyingAddressRoot now looks through OSSA instructions.

Rename getAccessedMemory to getDirectlyAccessedMemory with
comments. This is another dangerous API because it assumes the memory
access to a given address won't touch memory represented by different
graph nodes, but graph edges don't necessarily represent physical
indirection. Further clarify this issue in comments in
AliasAnalysis.cpp.

Factor out a 'findRecursiveRefType' helper from the old
'mayContainReference' for checking whether types may or must contain
references. Support both kinds of queries so the analysis can be
certain when a pointer's content is a physical heap object.

Factor out 'getPointerBase' and 'getPointerRoot' helpers that follow
address projections within what EscapeAnalysis will consider a single
node.

Create a CGNodeWorklist abstraction to safely formalize the worklist
mechanism that's used all over the place. In one place, there were
even two separate independent lists used independently (nodes added to
one list could appear to be in the other list).

The CGNodeMap abstraction did not significantly change, it was just moved.

Added 'dumpCG' for dumping .dot files making it possible to remote debug.

Added '-escapes-enable-graphwriter' option to dump .dot files, since
they are so much more useful than the textual dump of the connection
graph, which lacks node identity!

atrick · 2019-11-06T04:47:34Z

@eeckstein if you skim the diff side-by-side with the commit message, it should be quick to review. The commit message walks you through the diff. But if anything gets confusing just stop and ask me.

These are mostly the things that tripped me up when trying to understand the code or debugging my attempts to change the code.

atrick · 2019-11-06T04:47:40Z

@swift-ci test

atrick · 2019-11-06T04:47:47Z

@swift-ci benchmark

atrick · 2019-11-06T04:47:56Z

@swift-ci test source compatibility

atrick · 2019-11-06T05:40:14Z

@eeckstein Actually, there is one important functional change that fell out of
the cleanup. (If all tests pass, I'l fix the commit message,
otherwise, I'll remove the change, and debug it separately).

The functional change is that getPointerBase now sees through all of
these reference and pointer casts:

case ValueKind::UncheckedRefCastInst:
case ValueKind::ConvertFunctionInst:
case ValueKind::UpcastInst:
case ValueKind::InitExistentialRefInst:
case ValueKind::OpenExistentialRefInst:
case ValueKind::RawPointerToRefInst:
case ValueKind::RefToRawPointerInst:
case ValueKind::RefToBridgeObjectInst:
case ValueKind::BridgeObjectToRefInst:
case ValueKind::UncheckedAddrCastInst:
case ValueKind::UnconditionalCheckedCastInst:
case ValueKind::RefTo##Name##Inst:
case ValueKind::Name##ToRefInst:

This coalesces a whole bunch of nodes together that were just there
because of casts. The existing code was already doing this for one
level of casts, but there was a bug that prevented it from happening
transitively. So, in theory, anything that breaks with this fix could
also break without this fix, but may not have been exposed. The fact
that this analysis coalesces address-to-reference casts at all is what
caused me to spent vast amounts of time debugging any time I tried to
force some structure on the graph via assertions. If it is done at
all, it should be done everywhere consistently to expose issues as
early as possible.

swift-ci · 2019-11-06T05:40:54Z

Performance: -O

Regression	OLD	NEW	DELTA	RATIO
UTF8Decode_InitFromData_ascii_as_ascii	642	720	+12.1%	0.89x (?)
FloatingPointPrinting_Float_description_small	5616	6264	+11.5%	0.90x
StringComparison_fastPrenormal	1000	1080	+8.0%	0.93x

Improvement	OLD	NEW	DELTA	RATIO
LazilyFilteredArrayContains	35400	30100	-15.0%	1.18x
FlattenListLoop	4453	4098	-8.0%	1.09x (?)
RemoveWhereSwapInts	65	60	-7.7%	1.08x
DataSetCountSmall	148	137	-7.4%	1.08x (?)
MapReduceAnyCollection	398	369	-7.3%	1.08x (?)
Set.isDisjoint.Seq.Box.Empty	145	135	-6.9%	1.07x (?)
Set.isDisjoint.Seq.Int.Empty	90	84	-6.7%	1.07x
Set.isSuperset.Seq.Empty.Int	91	85	-6.6%	1.07x (?)
Set.isSubset.Int.Empty	91	85	-6.6%	1.07x (?)
Set.isStrictSubset.Int.Empty	91	85	-6.6%	1.07x (?)
ArrayLiteral2	137	128	-6.6%	1.07x (?)

Code size: -O

Performance: -Osize

Regression	OLD	NEW	DELTA	RATIO
FloatingPointPrinting_Float_description_small	5616	6264	+11.5%	0.90x

Improvement	OLD	NEW	DELTA	RATIO
RemoveWhereMoveInts	37	34	-8.1%	1.09x (?)
Array2D	7520	6928	-7.9%	1.09x (?)
RemoveWhereSwapInts	67	62	-7.5%	1.08x
DataSetCountSmall	151	140	-7.3%	1.08x (?)
FlattenListLoop	5313	4944	-6.9%	1.07x (?)
MapReduceAnyCollection	434	405	-6.7%	1.07x (?)
Set.isDisjoint.Seq.Int.Empty	90	84	-6.7%	1.07x (?)
Set.isSuperset.Seq.Empty.Int	90	84	-6.7%	1.07x (?)
Set.isStrictSubset.Int.Empty	91	85	-6.6%	1.07x (?)

Code size: -Osize

Performance: -Onone

Regression	OLD	NEW	DELTA	RATIO
ObjectiveCBridgeStubFromNSDateRef	4370	4850	+11.0%	0.90x (?)
NormalizedIterator_ascii	461	501	+8.7%	0.92x (?)

Improvement	OLD	NEW	DELTA	RATIO
ArrayOfGenericPOD2	1434	1324	-7.7%	1.08x (?)
ArrayOfPOD	1123	1038	-7.6%	1.08x (?)

Code size: -swiftlibs

How to read the data

The tables contain differences in performance which are larger than 8% and differences in code size which are larger than 1%.

If you see any unexpected regressions, you should consider fixing the
regressions before you merge the PR.

Noise: Sometimes the performance results (not code size!) contain false
alarms. Unexpected regressions which are marked with '(?)' are probably noise.
If you see regressions which you cannot explain you can try to run the
benchmarks again. If regressions still show up, please consult with the
performance team (@eeckstein).

Hardware Overview

  Model Name: Mac Pro
  Model Identifier: MacPro6,1
  Processor Name: 12-Core Intel Xeon E5
  Processor Speed: 2.7 GHz
  Number of Processors: 1
  Total Number of Cores: 12
  L2 Cache (per Core): 256 KB
  L3 Cache: 30 MB
  Memory: 64 GB

atrick · 2019-11-06T17:18:33Z

@swift-ci smoke test

This is the first in a series of patches that reworks EscapeAnalysis. For this patch, I extracted every change that does not introduce new features, rewrite logic, or appear to change functionality. These cleanups were done in preparation for: - adding a graph representation for reference counted objects - rewriting parts to the query logic - ...which then allows the analysis to safely assume that all exclusive arguments are unique - ...which then allows more aggressive optimization of local variables that don't escape There are two possible functional changes: 1. getUnderlyingAddressRoot in InstructionUtils now sees through OSSA instructions: begin_borrow and copy_value 2. The getPointerBase helper in EscapeAnalysis now sees through all of these reference and pointer casts: + case ValueKind::UncheckedRefCastInst: + case ValueKind::ConvertFunctionInst: + case ValueKind::UpcastInst: + case ValueKind::InitExistentialRefInst: + case ValueKind::OpenExistentialRefInst: + case ValueKind::RawPointerToRefInst: + case ValueKind::RefToRawPointerInst: + case ValueKind::RefToBridgeObjectInst: + case ValueKind::BridgeObjectToRefInst: + case ValueKind::UncheckedAddrCastInst: + case ValueKind::UnconditionalCheckedCastInst: + case ValueKind::RefTo##Name##Inst: + case ValueKind::Name##ToRefInst: This coalesces a whole bunch of nodes together that were just there because of casts. The existing code was already doing this for one level of casts, but there was a bug that prevented it from happening transitively. So, in theory, anything that breaks with this fix could also break without this fix, but may not have been exposed. The fact that this analysis coalesces address-to-reference casts at all is what caused me to spent vast amounts of time debugging any time I tried to force some structure on the graph via assertions. If it is done at all, it should be done everywhere consistently to expose issues as early as possible. Here is a description of the changes in diff order. If something in the diff is not listed here, then the code probably just moved around in the file: Rename isNotAliasedIndirectParameter to isExclusiveIndirectParameter. The argument may be aliased in the caller's scope and it's contents may have already escaped. Add comments to SILType APIs (isTrivial, isReferenceCounted) that give answers about the AST type which don't really make sense for address SILTypes. Add comments about CGNode's 'Value' field. I spent lots of time attempting to tighten this down with asserts, but it's only possible for non-content nodes. For content nodes, the node's value is highly unpredictable and basically nonsense but needed for debugging. Add comments about not assuming that the content nodes pointsTo edge represents physical indirection. This matters when reasoning about aliasing and it's a tempting assumption to make. Add a CGNode::mergeProperties placeholder for adding node properties. Factor out a CGNode::canAddDeferred helper for use later. Rename `setPointsTo` to `setPointsToEdge` because it actually creates an edge rather than just setting `pointsTo`. Add CGNode::getValue() and related helpers to help maintain invariants. Factor out a `markEscaping` helper. Clean up the `escapesInsideFunction` helper. Add node visitor helpers: visitSuccessors, visitDefers. This made is much easier to prototype utilities. Add comments to clarify the `pointsTo` invariant. In particular, an entire defer web may have a null pointsTo for a while. Add an `activeWorklist` to avoid nasty bugs when composing multiple helpers that each use the worklist. Remove the `EA` argument from `getNode`. I ended up needing access to the `EA` context from the ConnectionGraph many times during prototyping and passing `this` was all the `getNode` calls was very silly. Add graph visitor helpers: backwardTraverse, forwardTraverseDefer, forwardTraversePointsToEdges, and mayReach for ease in developing new logic and utilities. Add isExclusiveArgument helper and distinguish exclusive arguments from local objects. Confusing these properties could lead to scary bugs. For example, unlike a local object, an exclusive argument's contents may still escape even when the content's connection graph node is non-escaping! Add isUniquelyIdentified helper when we want to treat exclusive arguments and local objects uniformly. getUnderlyingAddressRoot now looks through OSSA instructions. Rename `getAccessedMemory` to `getDirectlyAccessedMemory` with comments. This is another dangerous API because it assumes the memory access to a given address won't touch memory represented by different graph nodes, but graph edges don't necessarily represent physical indirection. Further clarify this issue in comments in AliasAnalysis.cpp. Factor out a 'findRecursiveRefType' helper from the old 'mayContainReference' for checking whether types may or must contain references. Support both kinds of queries so the analysis can be certain when a pointer's content is a physical heap object. Factor out 'getPointerBase' and 'getPointerRoot' helpers that follow address projections within what EscapeAnalysis will consider a single node. Create a CGNodeWorklist abstraction to safely formalize the worklist mechanism that's used all over the place. In one place, there were even two separate independent lists used independently (nodes added to one list could appear to be in the other list). The CGNodeMap abstraction did not significantly change, it was just moved. Added 'dumpCG' for dumping .dot files making it possible to remote debug. Added '-escapes-enable-graphwriter' option to dump .dot files, since they are so much more useful than the textual dump of the connection graph, which lacks node identity!

atrick · 2019-11-06T19:10:14Z

@swift-ci smoke test

eeckstein

In general LGTM.
See my comments inline

lib/SILOptimizer/Analysis/EscapeAnalysis.cpp

eeckstein · 2019-11-06T19:24:01Z

We should not coalesce address-to-reference casts in favour of keeping this nice invariants in the graph to distinguish between "object" and "address" nodes (as we talked some time ago).
(Something for a follow-up PR).

atrick · 2019-11-06T21:34:03Z

@eeckstein I can't really say whether we lose important information in practice when we treat "interesting" casts as unknown pointers. It looked like PointerToAddress and AddressToPointer were important, and I felt that if we handle those we should continue to handle the rest. I'm afraid that if we stop recognizing local ArrayBuffers, we might miss some optimization. Who knows where else it might be important in the future.

I should have tried to eliminate cast handling first before adding information about heap objects to avoid time debugging issues where they kept breaking my verification. But now I have an implementation that supports the casts. The implementation isn't any more complicated, I just wasn't able to add as much self-checking. I want to make sure the changes I already have that may make some queries more conservative don't regress performance before adding other potential regressions on top of it.

atrick · 2019-11-06T21:38:16Z

@swift-ci smoke test

AnalyzeInstruction no longer needs to handle all these casts. getPointerBase does it now.

atrick · 2019-11-06T23:16:04Z

@swift-ci smoke test

eeckstein · 2019-11-06T23:42:11Z

@atrick We can still handle most of the casts, like pointer_to_address and address_to_pointer. Only ref_to_raw_pointer/raw_pointer_to_ref is problematic.
raw_pointer_to_ref is only used for the hack wit empty array/dictionary storage. But here we don't care, because those empty storage objects "escape" anyway.
ref_to_raw_pointer is used for array capacity computation and passed to a runtime function anyway. So also in this case it "escapes".

atrick · 2019-11-07T00:43:51Z

@eeckstein I have a few other EscapeAnalysis PRs to get through, then I can try removing support for merging address and ref type nodes. It's only useful if I can actually assert that we never mix address and ref types, but then I'm worried that assert will trigger in some rare situation we don't have tests for. It's hard for me to prove to myself that all possibilities are covered. For example:

bridge object casts: I guess we don't need to worry about this because the only non-address SILTypes that EA considers an "address" is RawPointer, and it never looks through any RawPointer casts.
unchecked_addr_cast: the content of one side of the cast could be RawPointer and the content of the other side could be a reference... so what happens when those content nodes are merged?

atrick requested a review from eeckstein November 6, 2019 04:45

eeckstein reviewed Nov 6, 2019

View reviewed changes

lib/SILOptimizer/Analysis/EscapeAnalysis.cpp Outdated Show resolved Hide resolved

eeckstein reviewed Nov 6, 2019

View reviewed changes

lib/SILOptimizer/Analysis/EscapeAnalysis.cpp Outdated Show resolved Hide resolved

atrick added 2 commits November 6, 2019 15:15

EscapeAnalysis: remove unused cast handling from the main loop.

cb1e787

AnalyzeInstruction no longer needs to handle all these casts. getPointerBase does it now.

EscapeAnalysis: remove unused pointsToEnd utility, for now.

13b7f45

atrick merged commit 9931aab into swiftlang:master Nov 7, 2019

atrick deleted the cleanup-escapes branch December 23, 2019 03:15

EscapeAnalysis cleanup and add utilities [nearly NFC] #28103

EscapeAnalysis cleanup and add utilities [nearly NFC] #28103

Uh oh!

Conversation

atrick commented Nov 6, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

atrick commented Nov 6, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

atrick commented Nov 6, 2019

Uh oh!

atrick commented Nov 6, 2019

Uh oh!

atrick commented Nov 6, 2019

Uh oh!

atrick commented Nov 6, 2019

Uh oh!

swift-ci commented Nov 6, 2019

Performance: -O

Code size: -O

Performance: -Osize

Code size: -Osize

Performance: -Onone

Code size: -swiftlibs

Uh oh!

atrick commented Nov 6, 2019

Uh oh!

atrick commented Nov 6, 2019

Uh oh!

eeckstein left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

eeckstein commented Nov 6, 2019

Uh oh!

atrick commented Nov 6, 2019

Uh oh!

atrick commented Nov 6, 2019

Uh oh!

atrick commented Nov 6, 2019

Uh oh!

eeckstein commented Nov 6, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

atrick commented Nov 7, 2019

Uh oh!

Uh oh!

atrick commented Nov 6, 2019 •

edited

Loading

atrick commented Nov 6, 2019 •

edited

Loading

eeckstein commented Nov 6, 2019 •

edited

Loading