[AutoDiff] devirtualize diff witnesses #28480

marcrasi · 2019-11-26T01:34:50Z

Adds an optimization pass that devirtualizes differentiability witnesses into functions that reference them, replacing differentiability_witness_functions with function_refs.

Resolves TF-919 and TF-994.

Performance impact

This completely eliminates the performance impact of #28451 under -O, except for cross-module non-serialized differentiability witnesses, by causing it to generate the same code that would be generated by tensorflow HEAD.

I confirmed this by measuring the microbenchmark posted at #28451 (comment) . I haven't experimentally confirmed that the google internal model is also fixed, but I will experimentally verify that before I merge #2845.

marcrasi · 2019-11-26T01:35:17Z

@swift-ci please test tensorflow

dan-zheng

LGTM!

I like your point that "devirtualization" isn't an apt name because there's no virtual dispatch.
Running the pass only with -O sounds good.

dan-zheng · 2019-11-26T01:44:43Z

lib/SILOptimizer/Transforms/DifferentiabilityWitnessInliner.cpp

+
+bool DifferentiabilityWitnessInliner::
+    inlineDifferentiabilityWitnessesInFunction(SILFunction &F) {
+  bool Changed = false;


Minor: how about consistently using camelcase spelling for variables in differentiable programming code? I don't feel too strongly as long as casing is locally consistent.

dan-zheng · 2019-11-26T01:46:49Z

lib/SILOptimizer/Transforms/DifferentiabilityWitnessInliner.cpp

+    auto *W = I->getWitness();
+    if (W->isDeclaration() && !F.getModule().loadDifferentiabilityWitness(W))
+      continue;
+    assert(W->isDefinition());


Did you mean to set Changed to true here?

dan-zheng · 2019-11-26T01:49:11Z

lib/Serialization/DeserializeSIL.cpp

-      SILMod, *linkage, original, parameterIndices, resultIndices,
-      derivativeGenSig, jvp, vjp, isSerialized);
-  diffWitnessOrOffset.set(diffWitness, /*isFullyDeserialized*/ true);
+  if (diffWitness->isDeclaration() && !isDeclaration)


Could you please explain when this condition is true and convertToDefinition is called?
It doesn't seem wholly obvious, perhaps an explanatory comment would be good.

dan-zheng · 2019-11-26T01:53:03Z

Could you please comment on how this patch impacts -O runtime performance?
Does it eliminate performance differences with current tensorflow HEAD?

… definition

rxwei · 2019-11-26T02:46:08Z

I named this "inliner" instead of "devirtualizer" because there is no runtime dynamism involved, which makes this seem more like inlining than devirtualization.

There is runtime dynamism. The way it works is like protocol witness methods: there's no virtual dispatch, but there is dispatch -- the derivative is fetched at runtime. SIL devirtualizer applies to witness methods even though there's no virtual dispatch, so I think "devirtualizer" totally applies to differentiability witness functions.

rxwei

Given the similarity to the SIL devirtualizer, maybe it's easier to just add some logic in swift::tryDevirtualizeApply that handles DifferentiabilityWitnessFunction instructions.

marcrasi · 2019-11-26T05:34:15Z

The way it works is like protocol witness methods: there's no virtual dispatch, but there is dispatch -- the derivative is fetched at runtime. SIL devirtualizer applies to witness methods even though there's no virtual dispatch, so I think "devirtualizer" totally applies to differentiability witness functions.

This sounds good, I will rename it to devirtualizer

Given the similarity to the SIL devirtualizer, maybe it's easier to just add some logic in swift::tryDevirtualizeApply that handles DifferentiabilityWitnessFunction instructions.

Currently this would be pretty involved because the SIL devirtualizer starts at apply methods and looks for the callee. A lot of the callees in differentiation cases are hidden behind differentiable_function and differentiable_function_extracts.

The pass as written handles the devirtualization at the reference sites, avoiding this difficulty.

Could you please comment on how this patch impacts -O runtime performance?
Does it eliminate performance differences with current tensorflow HEAD?

In combination with #28451, -O performance is the same as tensorflow HEAD in the cases that I tested. Will add comment to PR description about this.

rxwei · 2019-11-26T05:39:12Z

Currently this would be pretty involved because the SIL devirtualizer starts at apply methods and looks for the callee. A lot of the callees in differentiation cases are hidden behind differentiable_function and differentiable_function_extracts.

Makes sense. We need to do a proper differentiable_function-differentiable_function_extract folding pass at some point, after which we can revisit making tryDevirtualizeApply handle differentiability witness functions.

marcrasi · 2019-11-26T05:41:13Z

@swift-ci please test tensorflow

marcrasi · 2019-11-26T05:41:19Z

@swift-ci please test tensorflow

marcrasi · 2019-11-26T05:41:33Z

@swift-ci please test tensorflow

marcrasi · 2019-11-26T05:43:48Z

@swift-ci please test tensorflow macos

marcrasi · 2019-11-26T06:15:34Z

@swift-ci please test tensorflow

marcrasi · 2019-11-26T06:15:41Z

@swift-ci please test tensorflow

marcrasi · 2019-11-26T06:16:00Z

@swift-ci please test tensorflow

marcrasi · 2019-11-26T06:36:04Z

@swift-ci please test tensorflow

rxwei · 2019-11-26T07:26:19Z

@swift-ci please test tensorflow

rxwei · 2019-11-26T07:39:52Z

@swift-ci please test tensorflow

rxwei · 2019-11-26T08:01:27Z

@swift-ci please test tensorflow

marcrasi · 2019-11-26T16:55:28Z

@swift-ci please test tensorflow macos

marcrasi · 2019-11-26T18:34:40Z

@swift-ci please test tensorflow macos

[AutoDiff] inline diff witnesses

8a18086

marcrasi requested review from rxwei and dan-zheng November 26, 2019 01:34

dan-zheng approved these changes Nov 26, 2019

View reviewed changes

handle case where one module has a declaration and another has a full…

3c5f041

… definition

rxwei reviewed Nov 26, 2019

View reviewed changes

Marc Rasi added 2 commits November 25, 2019 21:24

address comments

8108c4d

rename inliner => devirtualizer

5970363

marcrasi changed the title ~~[AutoDiff] inline diff witnesses~~ [AutoDiff] devirtualize diff witnesses Nov 26, 2019

rxwei approved these changes Nov 26, 2019

View reviewed changes

fix mistake

7d0e396

Fix renamed sil-opt flag.

13a9205

marcrasi merged commit 82db8fa into tensorflow Nov 26, 2019

marcrasi deleted the marcrasi-inline-diff-wit branch November 26, 2019 20:32

marcrasi mentioned this pull request Nov 26, 2019

[AutoDiff] make diff_wit_fns instead of fn_refs #28451

Merged

dan-zheng mentioned this pull request Apr 12, 2020

[Autodiff upstream] Add DifferentiabilityWitnessDevirtualizer SILOptimizer pass #30984

Merged

dan-zheng mentioned this pull request Nov 26, 2019

[SR-13668] SILOptimizer: devirtualize differentiability_witness_function instructions #54252

Closed

[AutoDiff] devirtualize diff witnesses #28480

[AutoDiff] devirtualize diff witnesses #28480

Uh oh!

Conversation

marcrasi commented Nov 26, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

marcrasi commented Nov 26, 2019

Uh oh!

dan-zheng left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dan-zheng Nov 26, 2019

Choose a reason for hiding this comment

Uh oh!

marcrasi Nov 26, 2019

Choose a reason for hiding this comment

Uh oh!

dan-zheng Nov 26, 2019

Choose a reason for hiding this comment

Uh oh!

marcrasi Nov 26, 2019

Choose a reason for hiding this comment

Uh oh!

dan-zheng Nov 26, 2019

Choose a reason for hiding this comment

Uh oh!

marcrasi Nov 26, 2019

Choose a reason for hiding this comment

Uh oh!

dan-zheng commented Nov 26, 2019

Uh oh!

rxwei commented Nov 26, 2019

Uh oh!

rxwei left a comment

Choose a reason for hiding this comment

Uh oh!

marcrasi commented Nov 26, 2019

Uh oh!

rxwei commented Nov 26, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

marcrasi commented Nov 26, 2019

Uh oh!

marcrasi commented Nov 26, 2019

Uh oh!

marcrasi commented Nov 26, 2019

Uh oh!

marcrasi commented Nov 26, 2019

Uh oh!

marcrasi commented Nov 26, 2019

Uh oh!

marcrasi commented Nov 26, 2019

Uh oh!

marcrasi commented Nov 26, 2019

Uh oh!

marcrasi commented Nov 26, 2019

Uh oh!

rxwei commented Nov 26, 2019

Uh oh!

rxwei commented Nov 26, 2019

Uh oh!

rxwei commented Nov 26, 2019

Uh oh!

marcrasi commented Nov 26, 2019

Uh oh!

marcrasi commented Nov 26, 2019

Uh oh!

Uh oh!

marcrasi commented Nov 26, 2019 •

edited

Loading

dan-zheng left a comment •

edited

Loading

rxwei commented Nov 26, 2019 •

edited

Loading