Fix derivative of `RNN.callAsFunction(_:initialState:)`. #660

dan-zheng · 2020-02-03T23:32:21Z

Previously, RNN._vjpcallAsFunction(_:initialState:) incorrectly used a zero
initial state. Now, it uses initialState as the initial state.

Add RNN gradient tests for SimpleRNNCell, LSTMCell, and GRUCell.
Todo: verify that gradients are correct using a reference implementation.

Previously, `RNN._vjpcallAsFunction(_:initialState:)` incorrectly used a zero initial state. Now, it uses `initialState` as the initial state. Add `RNN` gradient tests for `SimpleRNNCell`, `LSTMCell`, and `GRUCell`. Todo: verify that gradients are correct using a reference implementation.

marcrasi · 2020-02-03T23:52:43Z

Tests/TensorFlowTests/LayerTests.swift

@@ -1190,22 +1189,62 @@ final class LayerTests: XCTestCase {
                     [ 0.074910110, 0.021107012, -0.049724963, -0.069670826],
                     [ 0.078670055, 0.022462710, -0.051899005, -0.075331904]],
                    accuracy: 1e-6)
+                let (𝛁lstm, _) = pullback(.init(inputs.map { LSTMCell<Float>.State(cell: $0, hidden: $0) }))
+                // TODO: Verify that LSTM gradients are correct using a reference implementation.


I'm interested in doing this. Should we do something like check some TF python code into this repository that reproduces the gradients?

That sounds good!

dan-zheng requested a review from marcrasi February 3, 2020 23:32

marcrasi approved these changes Feb 3, 2020

View reviewed changes

marcrasi reviewed Feb 3, 2020

View reviewed changes

dan-zheng merged commit 94ab4f5 into tensorflow:master Feb 4, 2020

dan-zheng deleted the fix-rnn-derivative branch February 4, 2020 00:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix derivative of `RNN.callAsFunction(_:initialState:)`. #660

Fix derivative of `RNN.callAsFunction(_:initialState:)`. #660

Uh oh!

dan-zheng commented Feb 3, 2020

Uh oh!

marcrasi Feb 3, 2020 •

edited

Loading

Uh oh!

dan-zheng Feb 4, 2020

Uh oh!

Uh oh!

Fix derivative of RNN.callAsFunction(_:initialState:). #660

Fix derivative of RNN.callAsFunction(_:initialState:). #660

Uh oh!

Conversation

dan-zheng commented Feb 3, 2020

Uh oh!

marcrasi Feb 3, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dan-zheng Feb 4, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Fix derivative of `RNN.callAsFunction(_:initialState:)`. #660

Fix derivative of `RNN.callAsFunction(_:initialState:)`. #660

marcrasi Feb 3, 2020 •

edited

Loading