use .moments() in LayerNorm and BatchNorm layers #384

mikowals · 2019-07-24T05:58:34Z

Mean and variance in the layers are now calculated using Tensor.moments(). I also added tests for both BatchNorm and LayerNorm layers.

The tests turned up a flaw in how the shape of the scale and offset which were always of shape [featureCount] irrespective of the input shape or axis for normalisation. That shape leads to incorrect broadcasting when the axis being normalized along is not the last axis.

I have fixed this by always reshaping scale and offset before they are used. This seems hacky in that I get the shapes from the calculated mean and variance. Without the input shape being known at initialization time though I couldn't see a better way to do this.

I think the axis argument is probably there to be consistent with Keras but most of the Swift api layers assume inputs and activations are NHWC. So requiring NHWC, eliminating the axis argument, and the setting the correct shapes in init() would be another option.

…o bn-test

rxwei

Nice!

Sources/TensorFlow/Layers/Normalization.swift

This reverts commit 6bcbc16.

This reverts commit 707f002.

BradLarson · 2019-08-08T21:51:24Z

@mikowals - The inference changes introduced here were triggering shape mismatches within the MiniGo sample project and its tests on tensorflow/swift-models. I've submitted a PR to roll those back here: #426 , but if you have a better way to approach this, I'm all ears.

mikowals added 2 commits July 24, 2019 15:37

update LayerNorm and BatchNorm with .moments() and add tests

707f002

Merge branch 'master' of https://github.com/tensorflow/swift-apis int…

b180db5

…o bn-test

rxwei approved these changes Jul 24, 2019

View reviewed changes

Sources/TensorFlow/Layers/Normalization.swift Outdated Show resolved Hide resolved

rxwei requested a review from eaplatanios July 24, 2019 06:04

rxwei added the enhancement New feature or request label Jul 24, 2019

mikowals added 4 commits July 24, 2019 17:27

add back _vjpApplied

6bcbc16

Revert "add back _vjpApplied"

93e6aa1

This reverts commit 6bcbc16.

Revert "update LayerNorm and BatchNorm with .moments() and add tests"

4e22058

This reverts commit 707f002.

update LayerNorm and BatchNorm with .moments() and add tests

7848118

rxwei added the kokoro:run label Jul 24, 2019

kokoro-team removed the kokoro:run label Jul 24, 2019

rxwei merged commit f5222cd into tensorflow:master Jul 24, 2019

BradLarson mentioned this pull request Aug 8, 2019

Reverting inference behavior for BatchNorm #426

Merged

eaplatanios added a commit to eaplatanios/swift-apis that referenced this pull request Aug 9, 2019

Fixes tensorflow#384 and tensorflow#426.

e0e8cbc

eaplatanios mentioned this pull request Aug 9, 2019

Normalization layers fix (fixes #384 and #426). #428

Merged

mikowals mentioned this pull request Aug 9, 2019

fix BatchNorm inference scale and offset shapes, add test #429

Closed

eaplatanios added a commit that referenced this pull request Aug 9, 2019

Normalization layers fix (fixes #384 and #426). (#428)

f218a34

mikowals deleted the bn-test branch May 20, 2020 04:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

use .moments() in LayerNorm and BatchNorm layers #384

use .moments() in LayerNorm and BatchNorm layers #384

Uh oh!

mikowals commented Jul 24, 2019

Uh oh!

rxwei left a comment

Uh oh!

Uh oh!

BradLarson commented Aug 8, 2019

Uh oh!

Uh oh!

use .moments() in LayerNorm and BatchNorm layers #384

use .moments() in LayerNorm and BatchNorm layers #384

Uh oh!

Conversation

mikowals commented Jul 24, 2019

Uh oh!

rxwei left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

BradLarson commented Aug 8, 2019

Uh oh!

Uh oh!