Update `min(_:_:)` and `max(_:_:)` gradients to match Python TensorFlow #480

jon-tow · 2019-08-24T00:54:42Z

Fixes #479

Tests/TensorFlowTests/TensorAutoDiffTests.swift

Sources/TensorFlow/Operators/Math.swift

jon-tow · 2019-08-24T02:48:32Z

@dan-zheng @rxwei This should be ready.

dan-zheng

Thank you for the prompt fix! 🙂

Sources/TensorFlow/Operators/Math.swift

dan-zheng · 2019-08-24T08:03:50Z

There appears to be one failure:

Test Case 'LossTests.testSigmoidCrossEntropyGrad' started at 2019-08-24 07:41:17.269
/swift-apis/Tests/TensorFlowTests/LossTests.swift:219: error: LossTests.testSigmoidCrossEntropyGrad : XCTAssertEqual failed: ("0.125") is not equal to ("0.0625") +/- ("1e-06") -

It's likely because sigmoidCrossEntropy internally calls max:

@differentiable(wrt: logits)
public func sigmoidCrossEntropy<Scalar: TensorFlowFloatingPoint>(
    logits: Tensor<Scalar>,
    labels: Tensor<Scalar>,
    reduction: @differentiable (Tensor<Scalar>) -> Tensor<Scalar> = _mean
) -> Tensor<Scalar> {
    let maxLogitsWithZero = max(logits, Tensor(0)) // `max` called here
    let result = log(1 + exp(-abs(logits)))
    return reduction(maxLogitsWithZero - logits * labels + result)
}

Could you please fix? Updating the expected value in the test should be good - validating the expected value against tf.losses.sigmoid_cross_entropy would be great (I'm not sure how expectedGradientsBeforeMean was computed in the test).

jon-tow · 2019-08-24T19:46:50Z

tf.losses.sigmoid_cross_entropy is computed via tf.python.ops.nn_impl.sigmoid_cross_entropy_with_logits. which is a bit different than the swift-api version in that it uses custom max and abs in order to compute gradients at 0. I'm still trying to find my way around this but will update as soon as possible.

Sources/TensorFlow/Loss.swift

Sources/TensorFlow/Operators/Math.swift

Tests/TensorFlowTests/TensorAutoDiffTests.swift

dan-zheng

Thanks @jon-tow for the many adjustments!

jon-tow · 2019-08-26T07:24:56Z

No problem @dan-zheng. Thanks for the guidance!

Update min(_:_:) and max(_:_:) gradients to match Python TensorFlow

7316275

rxwei reviewed Aug 24, 2019

View reviewed changes

Tests/TensorFlowTests/TensorAutoDiffTests.swift Show resolved Hide resolved

dan-zheng reviewed Aug 24, 2019

View reviewed changes

Sources/TensorFlow/Operators/Math.swift Outdated Show resolved Hide resolved

jon-tow added 3 commits August 23, 2019 21:37

Update comments and cleanup

4562cca

Fix block comments

83a1474

Apply Dan's optimization

b92dffd

dan-zheng approved these changes Aug 24, 2019

View reviewed changes

dan-zheng reviewed Aug 24, 2019

View reviewed changes

Sources/TensorFlow/Operators/Math.swift Outdated Show resolved Hide resolved

Re-format return values

3bbc384

rxwei added the kokoro:run label Aug 24, 2019

kokoro-team removed the kokoro:run label Aug 24, 2019

rxwei approved these changes Aug 24, 2019

View reviewed changes

Sources/TensorFlow/Operators/Math.swift Outdated Show resolved Hide resolved

Initialize mask with Tensor casting

89f74de

jon-tow added 2 commits August 24, 2019 21:27

Update sigmoidCrossEntropy and its gradient tests

b0b5123

Update python reference script for consistency

3898baf

rxwei added the kokoro:run label Aug 25, 2019

kokoro-team removed the kokoro:run label Aug 25, 2019

dan-zheng reviewed Aug 25, 2019

View reviewed changes

Sources/TensorFlow/Loss.swift Show resolved Hide resolved

eaplatanios reviewed Aug 25, 2019

View reviewed changes

Sources/TensorFlow/Operators/Math.swift Outdated Show resolved Hide resolved

Update min/max rhs argument gradients and tests

3852370

dan-zheng reviewed Aug 26, 2019

View reviewed changes

Tests/TensorFlowTests/TensorAutoDiffTests.swift Outdated Show resolved Hide resolved

Update variables to be immutable and use do statements

7eba493

dan-zheng approved these changes Aug 26, 2019

View reviewed changes

dan-zheng merged commit b7f2a06 into tensorflow:master Aug 26, 2019

jon-tow deleted the gradient/min-max branch August 26, 2019 21:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update `min(_:_:)` and `max(_:_:)` gradients to match Python TensorFlow #480

Update `min(_:_:)` and `max(_:_:)` gradients to match Python TensorFlow #480

Uh oh!

jon-tow commented Aug 24, 2019 •

edited by rxwei

Loading

Uh oh!

Uh oh!

Uh oh!

jon-tow commented Aug 24, 2019

Uh oh!

dan-zheng left a comment

Uh oh!

Uh oh!

Uh oh!

dan-zheng commented Aug 24, 2019 •

edited

Loading

Uh oh!

jon-tow commented Aug 24, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dan-zheng left a comment

Uh oh!

jon-tow commented Aug 26, 2019

Uh oh!

Uh oh!

Update min(_:_:) and max(_:_:) gradients to match Python TensorFlow #480

Update min(_:_:) and max(_:_:) gradients to match Python TensorFlow #480

Uh oh!

Conversation

jon-tow commented Aug 24, 2019 • edited by rxwei Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jon-tow commented Aug 24, 2019

Uh oh!

dan-zheng left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dan-zheng commented Aug 24, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jon-tow commented Aug 24, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dan-zheng left a comment

Choose a reason for hiding this comment

Uh oh!

jon-tow commented Aug 26, 2019

Uh oh!

Uh oh!

Update `min(_:_:)` and `max(_:_:)` gradients to match Python TensorFlow #480

Update `min(_:_:)` and `max(_:_:)` gradients to match Python TensorFlow #480

jon-tow commented Aug 24, 2019 •

edited by rxwei

Loading

dan-zheng commented Aug 24, 2019 •

edited

Loading