Batchnorm changes: fix axis handling and drop workaround for AD crasher #1

jekbradbury · 2019-02-12T04:38:47Z

The axis argument in the batch normalization layers in tf.keras and tf.layers refers to the second of the two axes that should be normalized over (see https://fenghz.github.io/images/2018-4-15/Batch_Norm_Picture.png), and defaults to the last axis (as it typically represents channels), while the first axis is always 0. We should match that semantics. We can also drop an AD workaround, enabling correct inference behavior.

Batchnorm changes: fix axis handling and drop workaround for AD crasher

6b0cf58

jekbradbury requested a review from rxwei February 12, 2019 04:38

rxwei approved these changes Feb 12, 2019

View reviewed changes

jekbradbury merged commit 93d8ea5 into tensorflow:master Feb 12, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Batchnorm changes: fix axis handling and drop workaround for AD crasher #1

Batchnorm changes: fix axis handling and drop workaround for AD crasher #1

Uh oh!

jekbradbury commented Feb 12, 2019

Uh oh!

Uh oh!

Batchnorm changes: fix axis handling and drop workaround for AD crasher #1

Batchnorm changes: fix axis handling and drop workaround for AD crasher #1

Uh oh!

Conversation

jekbradbury commented Feb 12, 2019

Uh oh!

Uh oh!