Skip to content
This repository was archived by the owner on Jul 1, 2023. It is now read-only.

Batchnorm changes: fix axis handling and drop workaround for AD crasher #1

Merged
merged 1 commit into from
Feb 12, 2019

Conversation

jekbradbury
Copy link
Contributor

The axis argument in the batch normalization layers in tf.keras and tf.layers refers to the second of the two axes that should be normalized over (see https://fenghz.github.io/images/2018-4-15/Batch_Norm_Picture.png), and defaults to the last axis (as it typically represents channels), while the first axis is always 0. We should match that semantics. We can also drop an AD workaround, enabling correct inference behavior.

@jekbradbury jekbradbury requested a review from rxwei February 12, 2019 04:38
@jekbradbury jekbradbury merged commit 93d8ea5 into tensorflow:master Feb 12, 2019
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants