[AutoDiff] Add more Tensor `broadcast`/`unbroadcast` differentiation tests. #24899

bartchr808 · 2019-05-19T07:27:56Z

Similar to the already existing set of tests for broadcast(toShape:)/unbroadcast(toShape:) in that this adds the same type of tests, but calling broadcast(to:)/unbroadcast(to:) and broadcast(like:)/unbroadcast(like:) instead.

bartchr808 · 2019-05-19T07:28:52Z

@swift-ci please test tensorflow

bartchr808 · 2019-05-19T07:29:12Z

@swift-ci please test tensorflow

…rmance. The inefficiency of `unbroadcast(toShape:)`, `unbroadcast(to:)`, and `unbroadcast(like:)` has caused significant performance problems during model training because it's performing a lot of TensorFlow operations to achieve axis calculation. We were forced to implement it this way in the early GPE era when neither send/receive nor per-op dispatch was available. This PR reimplements the unbroadcast operations in terms of host-side logic to compute axes to reduce along. This significantly reduces the TensorFlow opreation dispatch overhead. The base implementation changed from `broadcast(toShape:)` to `broadcast(to:)`. With the new implementation, differentiating broadcasting operators is 37% faster (see simple test script [here](https://gist.github.com/rxwei/e1488cac5379ba2bc3aff7490e18158f)). Note: - Since we now rely on the TensorFlow runtime less, more precondition checks and assertions are added to the newly implemented `unbroadcast(to:)` method. - The part of swiftlang#24408 that uses `Raw.broadcastGradientArgs(s0:s1:)` is still necessary for broadcasting binary operations to become faster. TODO: - Change `unbroadcast(toShape:)` tests added by swiftlang#24899 to use `unbroadcast(to:)`, since `unbroadcast(to:)` is now the base implementation.

…rmance. (#24907) The inefficiency of `unbroadcast(toShape:)`, `unbroadcast(to:)`, and `unbroadcast(like:)` has caused significant performance problems during model training because it's performing a lot of TensorFlow operations to achieve axis calculation. We were forced to implement it this way in the early GPE era when neither send/receive nor per-op dispatch was available. This PR reimplements the unbroadcast operations in terms of host-side logic to compute axes to reduce along. This significantly reduces the TensorFlow opreation dispatch overhead. The base implementation changed from `broadcast(toShape:)` to `broadcast(to:)`. With the new implementation, differentiating broadcasting operators is 37% faster (see simple test script [here](https://gist.github.com/rxwei/e1488cac5379ba2bc3aff7490e18158f)). Note: - Since we now rely on the TensorFlow runtime less, more precondition checks and assertions are added to the newly implemented `unbroadcast(to:)` method. - The part of #24408 that uses `Raw.broadcastGradientArgs(s0:s1:)` is still necessary for broadcasting binary operations to become faster. TODO: - Change `unbroadcast(toShape:)` tests added by #24899 to use `unbroadcast(to:)`, since `unbroadcast(to:)` is now the base implementation.

bartchr808 · 2019-06-13T21:42:31Z

Closing PR due to refactoring moving Tensor to tensorflow/swift-apis found in this PR.

bartchr808 added 6 commits May 17, 2019 09:44

Add all VJP functions, need to write tests.

8c9f88d

PR feedback batch #1.

af4859a

Use closure call to remove VJPs

105089b

Start adding tests (un)broadcast(toShape:).

e5eceff

Add tests to (un)broadcast(to/like:).

74e1129

Merge branch 'tensorflow' into TF-509-tensor-broadcast-differentiable

48d6129

bartchr808 added the tensorflow This is for "tensorflow" branch PRs. label May 19, 2019

bartchr808 requested review from rxwei and dan-zheng May 19, 2019 07:27

Move around tests b/w Tensor and AutoDiff.

68cc56a

rxwei mentioned this pull request May 20, 2019

[TF] Reimplement unbroadcast using on-host axis calculation for performance. #24907

Merged

Merge branch 'tensorflow' into TF-509-tensor-broadcast-differentiable

cf02cfc

bartchr808 mentioned this pull request Jun 13, 2019

Port over tensor_autodiff_runtime.swift tests. tensorflow/swift-apis#235

Merged

bartchr808 closed this Jun 13, 2019

bartchr808 deleted the TF-509-tensor-broadcast-differentiable branch June 13, 2019 21:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AutoDiff] Add more Tensor `broadcast`/`unbroadcast` differentiation tests. #24899

[AutoDiff] Add more Tensor `broadcast`/`unbroadcast` differentiation tests. #24899

Uh oh!

bartchr808 commented May 19, 2019

Uh oh!

bartchr808 commented May 19, 2019

Uh oh!

bartchr808 commented May 19, 2019

Uh oh!

bartchr808 commented Jun 13, 2019

Uh oh!

Uh oh!

[AutoDiff] Add more Tensor broadcast/unbroadcast differentiation tests. #24899

[AutoDiff] Add more Tensor broadcast/unbroadcast differentiation tests. #24899

Uh oh!

Conversation

bartchr808 commented May 19, 2019

Uh oh!

bartchr808 commented May 19, 2019

Uh oh!

bartchr808 commented May 19, 2019

Uh oh!

bartchr808 commented Jun 13, 2019

Uh oh!

Uh oh!

[AutoDiff] Add more Tensor `broadcast`/`unbroadcast` differentiation tests. #24899

[AutoDiff] Add more Tensor `broadcast`/`unbroadcast` differentiation tests. #24899