Fix Transposed Conv2d error & add test #288

Shashi456 · 2019-06-24T18:28:05Z

Fix: #282
This is a pretty long error post.
The transposed conv2d layer doesn't work. There are a number of issues that have popped up.

        let w = (input.shape[1] - (1 * paddingIndex)) *
          strides.0 + (filter.shape[0] * paddingIndex)
        let h = (input.shape[2] - (1 * paddingIndex)) *
          strides.1 + (filter.shape[1] * paddingIndex)

The first of errors i found was in the logic, Keras updates its own new dimensions by calculating

 assert padding in {'same', 'valid', 'full'}
    if dim_size is None:
        return None

    # Get the dilated kernel size
    kernel_size = (kernel_size - 1) * dilation + 1

    # Infer length if output padding is None, else compute the exact length
    if output_padding is None:
        if padding == 'valid':
            dim_size = dim_size * stride_size + max(kernel_size - stride_size, 0)
        elif padding == 'full':
            dim_size = dim_size * stride_size - (stride_size + kernel_size - 2)
        elif padding == 'same':
            dim_size = dim_size * stride_size
    else:
        if padding == 'same':
            pad = kernel_size // 2
        elif padding == 'valid':
            pad = 0
        elif padding == 'full':
            pad = kernel_size - 1

        dim_size = ((dim_size - 1) * stride_size + kernel_size - 2 * pad +
                    output_padding)

return dim_size

So i changed the above code to reflect this.

        let w = (input.shape[1] - 1) * 
          strides.0 + (filter.shape[0] * paddingIndex)
        let h = (input.shape[2] - 1) *
          strides.1 + (filter.shape[1] * paddingIndex)

Then what i found out was, that

Running this test in tensorflow produces the output as mentioned :

from functools import reduce
import operator
import tensorflow as tf

def product(iterable):
    return reduce(operator.mul, iterable, 1)

# Returns a tensor with increasing scalar values starting from zero,
# with the given shape.
def iota(shape):
    x = tf.range(0, product(shape), dtype=tf.float32)
    return tf.reshape(x, shape)

input_shape = [1,4,2,1]
filter_shape = [4,2,1,1]
strides = (1, 1, 1, 1)
input = iota(input_shape)
filter = iota(filter_shape)
conv3d = tf.nn.conv2d_transpose(input, filter, output_shape=(1,4,2,1), strides=strides, padding='SAME')
conv3 = tf.nn.bias_add(conv3d, [8])


with tf.Session() as session:
  result = session.run(conv3)
  print(result.shape)
  # (1, 4, 2, 1)
  print(result)
#   [[[[  8.]
#    [ 12.]]

#   [[ 12.]
#    [ 28.]]

#   [[ 24.]
#    [ 64.]]

#   [[ 48.]
#    [112.]]]]

But when i ran it in swift with .same padding we get the outputs

("[[[[ 93.0],
   [ 52.0]],

  [[148.0],
   [ 76.0]],

  [[ 93.0],
   [ 46.0]],

  [[ 46.0],
   [ 22.0]]]]")

If there's something i've inherently come to understand its that, The tensorflow conv2d_transpose is just a wrapper which is called by keras. You can check the code for it here

So running the function directly might be a cheap hack. The keras transposed conv2d is the place we actually compute the new dimensions, the code for it can be found here.
and the backend for that here.

I'm very confused by the code right now. But i think there's some place i'm committing a major mistake while trying to translate the code. Would appreciate some help, sorry for the very long introductory message.

Shashi456 · 2019-06-24T18:33:20Z

We dont currently support output paddings, so the logic falls to the default value which is zero, also we dont support full padding, that would possibly need control flow.

jekbradbury · 2019-06-25T00:50:16Z

This LGTM assuming the test case comes from a comparison with an existing framework (e.g. Keras).

Shashi456 · 2019-06-25T00:51:17Z

@jekbradbury, surprisingly doesn't work though

jekbradbury · 2019-06-25T00:51:57Z

Ohh, as in the test doesn't pass?

Shashi456 · 2019-06-25T00:52:20Z

Yea they don't.

jekbradbury · 2019-06-25T00:59:06Z

Does it give [22, 46, 46, 93, 76, 148, 52, 93] instead?

Shashi456 · 2019-06-25T01:02:20Z

@jekbradbury it gives

("[[[[ 93.0],
   [ 52.0]],

  [[148.0],
   [ 76.0]],

  [[ 93.0],
   [ 46.0]],

  [[ 46.0],
   [ 22.0]]]]

jekbradbury · 2019-06-25T01:24:26Z

OK, so what's happening is that some frameworks define transposed convolution in different (but almost equivalent) ways, and you can switch between these definitions by reflecting the filter over both spatial axes and swapping the input and output channel dimensions. (The reason for this is that one of these definitions is the standard mathematical definition for a transposed convolution, and the other allows the framework to reuse the same kernels as the normal convolution backwards pass).

I'm not quite sure how you're getting the numbers you just pasted, but they're the correct result for the "standard mathematical definition" (although flipped) and the expected numbers in the test are the correct result for the "backwards pass of convolution" definition that's used by TF/Keras. I'm confused as to why the test currently includes Conv2D<Float>; I assume you mean TransposedConv2D<Float>.

jekbradbury · 2019-06-25T01:26:05Z

I'm particularly confused because conv2dBackpropInput should give the "backwards pass of convolution" result (which is exactly what it sounds like) and it's also what Keras uses.

Shashi456 · 2019-06-25T04:14:20Z

@jekbradbury I've not seen any direct usage of the Tensorflow transposed conv2d though, It's mostly the keras api which is used which then in return ultimately calls the Tensorflow API. What I'm unsure about is, give the inputs I have, if I manually compute the output shape it's different from the output I'm getting.

Also since you mentioned it, would you suggest any changes for the current order of parameters that we use?

…o tconv2d

Shashi456 · 2019-06-25T06:15:34Z

the current error for the test is now :

Fatal error: Conv2DCustomBackpropInput: Size of out_backprop doesn't match computed: 
actual = 4, computed = 3 spatial_dim: 1 input: 3 filter: 4 output: 4 stride: 1 dilation: 1:

I think the test i wrote is wrong, because the code i wrote directly calls tf.nn.conv2d_tranpose which just calls backprop2dinput , while its in the keras api where the dimensions of the output are actually calculated

Sources/TensorFlow/Layers/Convolutional.swift

Shashi456 · 2019-06-26T07:34:48Z

@jekbradbury Do you have any idea as to how we could solve the latest error?

marcrasi · 2019-06-27T19:30:11Z

I haven't carefully read this whole discussion, but https://bugs.swift.org/browse/TF-540 and this thread might be related: https://groups.google.com/a/tensorflow.org/forum/#!msg/swift/UUPwV01sZrE/LszG6T7dBQAJ ?

Shashi456 · 2019-06-27T19:34:06Z

@marcrasi that was an extension error, there seems to be some error within the layer implementation as well or it's the test. Just trying to figure it out currently. That thread did help me earlier to figure out a few errors :) thanks.

sjaz24 · 2019-07-01T03:45:19Z

If you revert back to the original code, that is, put back in "- (1 * paddingIndex)" instead of always subtracting 1, then your test should work. It works for me. However, I get an error if I attempt to get gradients. Not sure if that should work or not??

    let filter = Tensor(shape: [4, 2, 1, 1], 
                        scalars: (0..<8).map(Float.init))
    let bias = Tensor<Float>([8])
    let layer = TransposedConv2D(filter: filter, 
                                 bias: bias, 
                                 activation: identity,
                                 strides: (1, 1), 
                                 padding: .same)
    let input = Tensor(shape: [1, 4, 2, 1], 
                       scalars: (0..<8).map(Float.init))
    let output = layer.inferring(from: input)
    let expected = Tensor<Float>(shape: [1, 4, 2, 1],
                                 scalars: [8, 12, 12, 28, 24, 64, 48, 112])
    print(output == expected)
    /* this outputs true. it works until this point */
   
    /* this fails */
    let (loss, grads) = layer.valueWithGradient { layer -> Tensor<Float> in 
        return layer(input).sum()
    }
    /* the following error occurs:
       Fatal error: Conv2DCustomBackpropFilter: input depth must be evenly divisible by filter depth: file /Users/danielzheng/swift-tf/tensorflow-swift-apis/Sources/TensorFlow/Bindings/EagerExecution.swift, line 299 Illegal instruction: 4
     */

sjaz24 · 2019-07-01T03:57:41Z

Also, not sure if this is an issue or not but the TransposedConv2D defines tensors as width then height whereas the underlying Raw.conv2DBackpropInput defines tensors as height then width for example from TransposedConv2D:

filter: A 4-D tensor of shape
     `[width, height, input channel count, output channel count]

But in func conv2DBackpropInput:

filter: 4-D with shape
    `[filter_height, filter_width, in_channels, out_channels]

Shashi456 · 2019-07-01T03:59:07Z

Yeah @sjaz24 I noticed that we might have documented it wrong

Shashi456 · 2019-07-28T07:16:05Z

@sjaz24 Are you sure this test passes locally for you?
Because i get this error,

Fatal error: Conv2DCustomBackpropInput: Size of out_backprop doesn't match computed: actual = 4, computed = 0 spatial_dim: 1 input: 3 filter: 4 output: 4 stride: 1 dilation: 1: file /swift-base/tensorflow-swift-apis/Sources/TensorFlow/Bindings/EagerExecution.swift, line 299
Current stack trace:
0    libswiftCore.so                    0x00007fec9e1b88d0 swift_reportError + 50
1    libswiftCore.so                    0x00007fec9e227ac0 _swift_stdlib_reportFatalErrorInFile + 115
2    libswiftCore.so                    0x00007fec9e14faee <unavailable> + 3738350
3    libswiftCore.so                    0x00007fec9e14fc67 <unavailable> + 3738727
4    libswiftCore.so                    0x00007fec9df1dc4d <unavailable> + 1436749
5    libswiftCore.so                    0x00007fec9e124a98 <unavailable> + 3562136
6    libswiftCore.so                    0x00007fec9df1d0a9 <unavailable> + 1433769
7    libswiftTensorFlow.so              0x00007fec9b45dc80 <unavailable> + 2669696
8    libswiftTensorFlow.so              0x00007fec9b2c2e00 checkOk(_:file:line:) + 461
9    libswiftTensorFlow.so              0x00007fec9b2c9f30 TFE_Op.evaluateUnsafe() + 506
10   libswiftTensorFlow.so              0x00007fec9b2ca7a0 TFE_Op.execute<A>(_:) + 132
11   libswiftTensorFlow.so              0x00007fec9b2d3434 <unavailable> + 1053748
16   libswiftTensorFlow.so              0x00007fec9b4a872b <unavailable> + 2975531
17   libswiftTensorFlow.so              0x00007fec9b59550c <unavailable> + 3945740
18   libswiftTensorFlow.so              0x00007fec9b42eba0 withContext<A>(_:_:) + 143
19   libswiftTensorFlow.so              0x00007fec9b42ed20 withLearningPhase<A>(_:_:) + 234
20   libswiftTensorFlow.so              0x00007fec9b4a85a0 Layer.inferring(from:) + 232
22   repl_swift                         0x0000000000400490 <unavailable> + 1168

sjaz24 · 2019-07-30T04:24:00Z

Yes, it works. I just basically ran the same code in Colab.

sjaz24 · 2019-07-30T04:26:48Z

You most likely haven't undone your changes. You need to put back the original code that you changed.

(1 * paddingIndex))

Sources/TensorFlow/Layers/Convolutional.swift

t-ae · 2019-08-02T08:23:17Z

@sjaz24 pointed out about filter shape order in documentation.
It looks there's another mistake, input channel count and output channel count.

Basically conv2DBackpropInput is for backpropagating Conv2D.
So what conv2DBackpropInput calls in_channels is Conv2D's input channel count, not TransposedConv2D's.
TransposedConv2D's filter shape is transposition of Conv2D's filter shape. So input channel count and output channel count must be swapped.

It's what Marc Rasi says here:
https://groups.google.com/a/tensorflow.org/forum/m/#!msg/swift/UUPwV01sZrE/LszG6T7dBQAJ

In summary, the documentation of filter should be:

filter: A 4-D tensor of shape
     `[height, width, output channel count, input channel count]

Shashi456 · 2019-08-19T08:41:43Z

Fixed this, Ready to be reviewed. Test and build pass locally.

Tests/TensorFlowTests/LayerTests.swift

saeta · 2019-11-07T18:48:34Z

Hi @Shashi456! Are you able to fix the merge conflicts here? Thanks! -Brennan

Shashi456 · 2019-11-08T18:05:34Z

@saeta done.

Shashi456 and others added 9 commits June 24, 2019 17:51

Fixing Transposed conv2d errors

8cf9fda

adding transposed conv2d test

cd00fbf

updating according to parameter doc

8f44e69

Updating tconv2d test

1150581

Updating tconv2d test

86af57f

test changes

6bfdeb3

Some changes in the test

72d7c8f

Merging master

73206c2

Merge errors

3527a4a

Minor error

3243306

Shashi456 added 2 commits June 25, 2019 11:33

Another transposed conv2d error

0fac0ec

Merge branch 'tconv2d' of https://github.com/Shashi456/swift-apis int…

fafdf1e

…o tconv2d

rxwei suggested changes Jun 25, 2019

View reviewed changes

Sources/TensorFlow/Layers/Convolutional.swift Outdated Show resolved Hide resolved

Making tensors generic over scalar type

a96c6f4

rxwei reviewed Jun 25, 2019

View reviewed changes

Sources/TensorFlow/Layers/Convolutional.swift Outdated Show resolved Hide resolved

Sources/TensorFlow/Layers/Convolutional.swift Outdated Show resolved Hide resolved

Review changes

6ee8541

This was referenced Jul 3, 2019

_vjpConv2DBackpropInput using shape instead of using filter size for … #331

Merged

Another fix for vjp conv2 d backprop input #333

Merged

Merge branch 'master' into tconv2d

8995a92

t-ae reviewed Aug 2, 2019

View reviewed changes

Sources/TensorFlow/Layers/Convolutional.swift Outdated Show resolved Hide resolved

Shashi456 and others added 3 commits August 19, 2019 13:42

Merge branch 'master' into tconv2d

19c26f7

Fixing argument order

757450b

Updating acc to review

1410c44

Shashi456 mentioned this pull request Aug 21, 2019

Add Separable Conv1D layer #458

Merged

Shashi456 mentioned this pull request Aug 30, 2019

Add Conv3D gradient test and vjp fixes #460

Merged

Merge branch 'master' into tconv2d

444c023

rxwei suggested changes Aug 31, 2019

View reviewed changes

Tests/TensorFlowTests/LayerTests.swift Outdated Show resolved Hide resolved

Shashi456 changed the title ~~Fixing transposed conv2d error~~ Fix Transposed Conv2d error & add test Aug 31, 2019

saeta requested a review from marcrasi November 7, 2019 18:48

saeta assigned marcrasi Nov 7, 2019

Merge branch 'master' into tconv2d

139f914

marcrasi added the kokoro:run label Nov 8, 2019

kokoro-team removed the kokoro:run label Nov 8, 2019

marcrasi merged commit 35dfddf into tensorflow:master Nov 8, 2019

Fix Transposed Conv2d error & add test #288

Fix Transposed Conv2d error & add test #288

Uh oh!

Conversation

Shashi456 commented Jun 24, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Shashi456 commented Jun 24, 2019

Uh oh!

jekbradbury commented Jun 25, 2019

Uh oh!

Shashi456 commented Jun 25, 2019

Uh oh!

jekbradbury commented Jun 25, 2019

Uh oh!

Shashi456 commented Jun 25, 2019

Uh oh!

jekbradbury commented Jun 25, 2019

Uh oh!

Shashi456 commented Jun 25, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jekbradbury commented Jun 25, 2019

Uh oh!

jekbradbury commented Jun 25, 2019

Uh oh!

Shashi456 commented Jun 25, 2019

Uh oh!

Shashi456 commented Jun 25, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Shashi456 commented Jun 26, 2019

Uh oh!

marcrasi commented Jun 27, 2019

Uh oh!

Shashi456 commented Jun 27, 2019

Uh oh!

sjaz24 commented Jul 1, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sjaz24 commented Jul 1, 2019

Uh oh!

Shashi456 commented Jul 1, 2019

Uh oh!

Shashi456 commented Jul 28, 2019

Uh oh!

sjaz24 commented Jul 30, 2019

Uh oh!

sjaz24 commented Jul 30, 2019

Uh oh!

Uh oh!

t-ae commented Aug 2, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Shashi456 commented Aug 19, 2019

Uh oh!

Uh oh!

saeta commented Nov 7, 2019

Uh oh!

Shashi456 commented Nov 8, 2019

Uh oh!

Uh oh!

Shashi456 commented Jun 24, 2019 •

edited

Loading

Shashi456 commented Jun 25, 2019 •

edited

Loading

Shashi456 commented Jun 25, 2019 •

edited

Loading

sjaz24 commented Jul 1, 2019 •

edited

Loading

t-ae commented Aug 2, 2019 •

edited

Loading