[Const evaluator] Add support to "skip" instructions in step-wise evaluation #24113

ravikandhadai · 2019-04-18T02:05:29Z

No description provided.

ravikandhadai · 2019-04-18T02:23:44Z

@marcrasi @devincoughlin This commit adds a new functionality to the stepwise constant evaluator which is "skipping" instructions without evaluating them while conservatively accounting for the effects of the skipped instructions on the interpreter state.

This enables a client to step through a sequence of instructions (possibly containing function calls) and decide to evaluate some and skip others, with a guarantee that the constant values found by the interpreter are sound. (That is, whenever constant values are inferred for variables during interpretation at a program point, those values match the runtime values of the variables if/whenever control reaches that program point.)

This functionality is only available at outer most level of interpretation, i.e, when a function call is evaluated it will be evaluated normally using flow-sensitive evaluation and does require all instructions in the function body to be interpretable and have constant values.

@marcrasi While this is a functionality aimed at optimization of the new os log APIs, I think this will also enable replacing/simplifying the existing top-level, backward evaluation mode of the interpreter used for extracting the arguments to #asserts as constants (e.g. functions like getSingleWriterAddressValue etc.), and have a more unified approach that uses only flow-sensitive evaluation mode. Furthermore, I think this will also enable eliminating the mutual recursion in the interpreter, which I believe mainly exists for accomplishing this backward evaluation.

ravikandhadai · 2019-04-18T02:25:26Z

@swift-ci Please smoke test

ravikandhadai · 2019-04-18T02:25:51Z

Btw, see only the latest commit in this PR, which contains the relevant changes.

ravikandhadai · 2019-04-18T20:56:23Z

@swift-ci Please smoke test

ravikandhadai · 2019-04-18T23:37:34Z

@swift-ci Please smoke test Linux Platform

marcrasi · 2019-04-19T00:02:48Z

The implementation makes sense to me, but I don't completely understand the bigger picture and the purpose of this change.

Specifically, something seems missing: Any client using the "skip" functionality will have to do a first pass through the instructions to determine which ones it wants to skip. For example, rewriting top-level evaluation in terms of this new functionality would look something like this:

let instructionsToEvaluate = transitiveInstructionsInfluencing(valueWeWant)
for inst in f {
  if inst in instructionsToEvaluate {
    evaluator.evaluate(inst)
  } else {
    evaluator.skip(inst)
  }
}
return evaluator.lookupConstValue(valueWeWant)

transitiveInstructionsInfluencing is going to be a bit complicated, and seems like something that the const-evaluation infrastructure should implement.

Is that something you're planning to implement later in the const-evaluation infrastructure? Or are you planning to implement something like it in your client? Or are you planning a client that uses skip in a completely different way that I haven't thought of?

marcrasi · 2019-04-19T00:11:59Z

Oh, and if the const-evaluation infrastructure does provide a correct transitiveInstructionsInfluencing function, then skip won't be necessary because evaluating all the instructions returned by transitiveInstructionsInfluencing is sufficient to get a sound value.

ravikandhadai · 2019-04-19T00:39:23Z

For example, rewriting top-level evaluation in terms of this new functionality would look something like this:

That's exactly what I had in mind, when I mentioned it as a potential application. It might just be enough if transitiveInstructionsInfluencing pulls in the data and control dependences of arguments to #assert, as a purely syntactic analysis that uses use-def chains. In some sense, doing more evaluation will only affect running time of the interpreter and not the correctness because if the evaluation fails anywhere, we can skip the instruction (this is what the helper function tryEvaluateOrElseSkip does) and if we end up finding more constants than what is needed in the #assert, we can ignore it. My intuition is that if we look at only control/data-dependences of #assert argument, it should be quite sufficient for performance, or at least as performant as the backward propagation we have now, which also uses the use-def chain and attempt to evaluate the dependences (though it gives up in some cases like multiple writers).

I see two advantages of this compared to the existing backward traversal: (1) we can simplify the interpreter and not have different code paths for flow-sensitive and top-level evaluation, (2) we can start making the interpreter iterative (instead of recursive) and make the top-level evaluation as powerful as the flow-sensitive mode e.g. by getting rid of single-writer restriction etc.

Is that something you're planning to implement later in the const-evaluation infrastructure?

Not really. This is just a thought that crossed my mind and wanted to share it. The main goal of creating this is to use it in a controlled way in the os_log optimization client (see below).

Or are you planning a client that uses skip in a completely different way that I haven't thought of?

In the client, which I am now developing (see here) for a rough implementation), we explicitly annotate functions that need to be interpreted using @_semantics annotation. We know exactly what those functions are in this case. In fact, we know where to start interpreting and where to stop, and will only evaluate calls to functions marked with that attribute and skip the rest.

ravikandhadai · 2019-04-19T00:46:14Z

Oh, and if the const-evaluation infrastructure does provide a correct transitiveInstructionsInfluencing function, then skip won't be necessary because evaluating all the instructions returned by transitiveInstructionsInfluencing is sufficient to get a sound value.

Yeah, but that requires precisely knowing all transitiveInstructionsInfluencing. This could be difficult if we have branches. If we have skip, we can allow over-approximating them using just a use-def chain. I am just mostly hand waving here. May be "skip" is not entirely necessary for this application. But, if we can eliminate the backward analysis mode and substitute it with flow-sensitive mode that seems useful (for simplifying the interpreter) without losing functionality.

ravikandhadai · 2019-04-19T01:29:58Z

An interesting example crossed my mind to illustrate my earlier thought that it is easier to have skip and a somewhat naive transitiveInstructionsInfluencing. (Interestingly, this is also the crux of the code we are trying to interpret in our client application).

    struct S {
        var const = 0
        var nonConst = 1
    } 

    func bar(x: Int) {
        var  s = S()
        var s2 = S()
        s2.nonConst += x
        s = s2
        #assert(s.const == 0)
   }

Currently, we would give up on this example as s has two writers. Furthermore, if we try to compute the transitiveInstructionsInfluencing (possibly using use-def chains) it will include s2.nonConst += x. If we only have evaluate and no skip, we will fail at s2.nonConst += x and give up. If we have skip, we can skip that instruction and continue (while being sound) and find that s.const is 0.

Edit: I changed the example as the earlier one had some errors.

marcrasi

Thanks for explaining your use case! I'm understanding better now.

I noticed that this approach gives up immediately when there is nonconstant control flow, preventing it from finding later constant values. e.g.

func foo(_ x: Bool) {
  if x {
    print("hi")
  }
  let bla = 1
  #assert(bla == 1)
}

I think it would require some nontrivial extra analysis to extend this approach to handle situations like that.

Will that be a problem for your use case?

marcrasi · 2019-04-19T03:28:59Z

include/swift/AST/DiagnosticsSIL.def

+    "branch depends on non-constant value obtained by skipping instructions",())
+NOTE(constexpr_returned_by_skip,none, "return value of a skipped instruction "
+     "is not a constant", ())
+NOTE(constexpr_mutated_by_skip,none, "value mutatable by a skipped instruction "


s/mutable/mutated/

marcrasi · 2019-04-19T03:34:21Z

include/swift/AST/DiagnosticsSIL.def

+     "is not a constant", ())
+NOTE(constexpr_mutated_by_skip,none, "value mutatable by a skipped instruction "
+    "is not a constant", ())
+


"skipped instruction" doesn't seem like a concept that should be exposed to the user.

In context of the os_log client, users might understand something like "result of an unrecognized operation" / "mutated by an unrecognized operation". What do you think about changing the diagnostic messages to those?

That right. It is not a user concept. It added this note for use in tests. It is not a part of the os_log client. Ideally, I would like to move all diagnostics to clients (like #assert client, tester client etc.) and specialize them as needed. In some sense, they are a part of the client model on how they handle errors in evaluation, as the interpreter itself is not a user-level concept, as of now.

ravikandhadai · 2019-04-19T17:36:30Z

Thanks for explaining your use case! I'm understanding better now.

I noticed that this approach gives up immediately when there is nonconstant control flow, preventing it from finding later constant values. e.g.
func foo(_ x: Bool) {
  if x {
    print("hi")
  }
  let bla = 1
  #assert(bla == 1)
}
I think it would require some nontrivial extra analysis to extend this approach to handle situations like that.

First thanks for providing these examples and the review. I think the analysis for computing relevant instruction to interpret could be a simple control/data dependent instructions (or in other words, a intraprocedural, backwards slice). But we can make the "driver" that invokes the interpreter on these relevant instructions smarter and in fact simpler :-), e.g. I am thinking of something like this:

let instructionsToEvaluate = transitiveInstructionsInfluencing(valueWeWant)
for inst in f {
  if inst in instructionsToEvaluate {
    evaluator.tryEvaluateOrSkip(inst)
     // A bit of special handling for branches.
     if `inst` is a branch, if it doesn't evaluate to a constant value, error and break
     otherwise, remove instructions in the other arm of the branch from `instructionsToEvlauate`.
  } 
}
return evaluator.lookupConstValue(valueWeWant)

We literally ignore (not even "skip") all instructions not in the backward slice. This handles your example, as we won't even look at the if condition etc. For this, we only need a guarantee that transitiveInstructionsInfluencing will include all relevant instructions. It could over-approximate them but could not leave out anything. It seems like this will be strictly more powerful than what we have currently. WDYT?

Btw, after writing this example, I think "skip" would need a better name inline with what it is doing and it is not quite "ignore". Perhaps, something like "approximateEffects" would be better?

Btw, as you know, even now we are implicitly computing some parts of this transitiveInstructionsInfluencing in singleWriterAddressValue by looking at sources of stores and recursively computing their constant value. Separating that logic out into a function like transitiveInstructionsInfluencing would be good I guess.

Will that be a problem for your use case?

That's a good question. No, the code that needs to be interpreted (which is not user-provided) does not have non-constant branches in my use case. However, these are like implicit constraints. We need to have good diagnostics there to help someone who changes the code and possibly gets stuck in such aspects as a part of that client.

marcrasi · 2019-04-20T06:39:28Z

We literally ignore (not even "skip") all instructions not in the backward slice. This handles your example, as we won't even look at the if condition etc. For this, we only need a guarantee that transitiveInstructionsInfluencing will include all relevant instructions. It could over-approximate them but could not leave out anything. It seems like this will be strictly more powerful than what we have currently. WDYT?

Makes sense, and sounds like a pretty good approach. I think the hypothetical future constexpr client won't actually want all the extra power because we want to keep the programming model simple, and allowing things like mutation in top level code (as long as it's "constant evaluable mutation") will make it harder to understand when something is const evaluable. But the elimination of the (confusing) mutual recursion sounds really good and simplifying.

Btw, after writing this example, I think "skip" would need a better name inline with what it is doing and it is not quite "ignore". Perhaps, something like "approximateEffects" would be better?

Yeah, "approximateEffects" sounds better than "skip" to me. And "tryEvaluateOrElseApproximateEffects" is very clear.

marcrasi · 2019-04-20T06:22:35Z

lib/SILOptimizer/Utils/ConstExpr.cpp

+      continue;
+    }
+    auto constVal = constValOpt.getValue();
+    if (constVal.getKind() != SymbolicValue::Address) {


Looking at this again after having just thought about #22772, I realize that this will have to be careful about aggregates containing addresses (and also addresses pointing at aggregates containing addresses).

There is no such thing as an aggregate containing an address now. Even after arrays are merged, this won't have to handle the array aggregates containing an address, because they have value semantics. (I haven't thought this through super carefully yet, but I think it's true.)

So I'm reasonably sure that no action is required now.

But if/when the constant evaluator gets reference types, this will be a problem. (Perhaps it will never get reference types during top level evaluation, exactly because of problems like this.) Is there a way to make sure we don't forget?

Actually, I remembered that reference types show up as reference types in SIL, not as addresses. So there should never be any aggregates containing addresses (except for things like arrays that are value types but that contain addresses as an implementation detail). So this will not have to worry about addresses in aggregates.

The point that it will have to worry about reference types is still valid. But we don't have reference types now.

Those are interesting observations. You are right that if the interpreter starts supporting reference types (or structs with reference types), this way of estimating effects is not right. We need to do a traversal of the reachable memory locations to correctly approximate all possible effects.

As you say Arrays and Strings are fine as they have value semantics. It does seem to me that we don't need any additional work for Arrays.

It doesn't look like "Structs with addresses" is a thing in SIL, unless we allow unsafe pointers, which can store addresses. However, allowing anything like that in the interpreted fragment does seem messy.

Is there a way to make sure we don't forget?

I think we can document code, add a test that checks that diagnostics are emitting for skip with reference types, and also add an assert to skip that will fail on any new kind of symbolic object that is not currently supported. This will force anyone who is adding a new kind of symbolic object to look at skip.

ravikandhadai · 2019-04-22T18:28:09Z

I think the hypothetical future constexpr client won't actually want all the extra power because we want to keep the programming model simple, and allowing things like mutation in top level code (as long as it's "constant evaluable mutation") will make it harder to understand when something is const evaluable. But the elimination of the (confusing) mutual recursion sounds really good and simplifying.

I completely agree with you.

ravikandhadai · 2019-04-27T02:28:38Z

Rebased and fixed the things we discussed above such as giving "skip" a better name, adding assertions and documentation to make sure that new symbolic values added in the future would require thinking about "skip" function.

ravikandhadai · 2019-04-27T02:28:55Z

@swift-ci Please smoke test

ravikandhadai · 2019-04-27T03:32:52Z

@swift-ci Please test

swift-ci · 2019-04-27T03:35:02Z

Build failed
Swift Test Linux Platform
Git Sha - 14ba86aa54f00452526be1a47a51828f8fd61dec

ravikandhadai · 2019-04-27T03:37:31Z

@swift-ci Please test Linux Platform

swift-ci · 2019-04-27T05:06:35Z

Build failed
Swift Test OS X Platform
Git Sha - 10ab00728290e04a83c0ecbae4b19227fb87cb44

ravikandhadai · 2019-04-27T17:33:12Z

@swift-ci Please test macOS Platform

ravikandhadai · 2019-04-27T17:45:59Z

@swift-ci Please test macOS Platform

swift-ci · 2019-04-27T18:29:45Z

Build failed
Swift Test OS X Platform
Git Sha - b2a56bff55de838076b66f1faf8c5d4721d4b657

instructions without evaluating them while conservatively accounting for the effects of the skipped instructions on the interpreter state.

ravikandhadai · 2019-04-30T22:56:34Z

Adding a minor enhancement: while skipping calls which are passed addresses, the contents of the addresses that are passed @in_guaranteed or @in_constant need not be reset to unknown as they cannot be mutated by the call.

ravikandhadai · 2019-04-30T22:57:02Z

@swift-ci Please test

ravikandhadai requested review from devincoughlin and marcrasi April 18, 2019 02:05

marcrasi reviewed Apr 19, 2019

View reviewed changes

marcrasi approved these changes Apr 20, 2019

View reviewed changes

ravikandhadai force-pushed the constexpr-skip branch from 10ab007 to 14ba86a Compare April 27, 2019 02:26

ravikandhadai force-pushed the constexpr-skip branch from 14ba86a to b2a56bf Compare April 27, 2019 03:12

ravikandhadai mentioned this pull request Apr 27, 2019

[SIL Optimization] Add a mandatory pass for optimizing the new os log APIs based on string interpolation. #24336

Merged

[Const evaluator] Enable stepwise constant evaluator to skip

b0e56f7

instructions without evaluating them while conservatively accounting for the effects of the skipped instructions on the interpreter state.

ravikandhadai force-pushed the constexpr-skip branch from b2a56bf to b0e56f7 Compare April 30, 2019 22:53

ravikandhadai merged commit d912e33 into swiftlang:master May 2, 2019

[Const evaluator] Add support to "skip" instructions in step-wise evaluation #24113

[Const evaluator] Add support to "skip" instructions in step-wise evaluation #24113

Uh oh!

Conversation

ravikandhadai commented Apr 18, 2019

Uh oh!

ravikandhadai commented Apr 18, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ravikandhadai commented Apr 18, 2019

Uh oh!

ravikandhadai commented Apr 18, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ravikandhadai commented Apr 18, 2019

Uh oh!

ravikandhadai commented Apr 18, 2019

Uh oh!

marcrasi commented Apr 19, 2019

Uh oh!

marcrasi commented Apr 19, 2019

Uh oh!

ravikandhadai commented Apr 19, 2019

Uh oh!

ravikandhadai commented Apr 19, 2019

Uh oh!

ravikandhadai commented Apr 19, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

marcrasi left a comment

Choose a reason for hiding this comment

Uh oh!

marcrasi Apr 19, 2019

Choose a reason for hiding this comment

Uh oh!

marcrasi Apr 19, 2019

Choose a reason for hiding this comment

Uh oh!

ravikandhadai Apr 19, 2019

Choose a reason for hiding this comment

Uh oh!

ravikandhadai commented Apr 19, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

marcrasi commented Apr 20, 2019

Uh oh!

marcrasi Apr 20, 2019

Choose a reason for hiding this comment

Uh oh!

marcrasi Apr 22, 2019

Choose a reason for hiding this comment

Uh oh!

ravikandhadai Apr 22, 2019

Choose a reason for hiding this comment

Uh oh!

ravikandhadai commented Apr 22, 2019

Uh oh!

ravikandhadai commented Apr 27, 2019

Uh oh!

ravikandhadai commented Apr 27, 2019

Uh oh!

ravikandhadai commented Apr 27, 2019

Uh oh!

swift-ci commented Apr 27, 2019

Uh oh!

ravikandhadai commented Apr 27, 2019

Uh oh!

swift-ci commented Apr 27, 2019

Uh oh!

ravikandhadai commented Apr 27, 2019

Uh oh!

ravikandhadai commented Apr 27, 2019

Uh oh!

swift-ci commented Apr 27, 2019

Uh oh!

ravikandhadai commented Apr 30, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ravikandhadai commented Apr 30, 2019

Uh oh!

ravikandhadai commented Apr 18, 2019 •

edited

Loading

ravikandhadai commented Apr 18, 2019 •

edited

Loading

ravikandhadai commented Apr 19, 2019 •

edited

Loading

ravikandhadai commented Apr 19, 2019 •

edited

Loading

ravikandhadai commented Apr 30, 2019 •

edited

Loading