[ET-VK][Ops] aten.convolution (SlidingWindow) #2812

jorgep31415 · 2024-04-02T19:06:32Z

Stack from ghstack (oldest at bottom):

The Operator

nn.Module invocations of nn.Conv2d and nn.ConvTranspose2d get compiled to aten.convolution.default in the Edge Dialect, which carries the signature

- func: convolution(Tensor input, Tensor weight, Tensor? bias, int[] stride, SymInt[] padding, int[] dilation, bool transposed, SymInt[] output_padding, int groups) -> Tensor

Summary (cases handled)

We introduce support for the convolution cases covered by ATen-VK's default SlidingWindow implementation. This is achieved by

reusing the existing conv2d.glsl, and
moving special weights prepacking from CPU to the GPU in conv2d_prepack_weights.glsl.

We also include resizing support for dynamic shapes. Note that only height and width of the input can vary.

Cases not handled

The implementation is on-par with ATen-VK's SlidingWindow. This means the following cases are missing:

Groups G > 1. Largely not covered by ATen-VK. G = in_channels is covered by ATen-VK's Depthwise impl and will be added soon.
Batch (input) N > 1. Not covered by ATen-VK.
Padding > 0 while Dilation, Kernel > 1. Not covered by ATen-VK.

Coming soon

Transpose convolution
Depthwise convolution (for completeness)
Pointwise convolution (for optimization)
Null bias

Differential Revision: D55346778

## The Operator `nn.Module` invocations of [`nn.Conv2d`](https://pytorch.org/docs/stable/generated/torch.nn.Conv2d.html#torch.nn.Conv2d) and [`nn.ConvTranspose2d`](https://pytorch.org/docs/stable/generated/torch.nn.ConvTranspose2d.html#torch.nn.ConvTranspose2d) get compiled to `aten.convolution.default` in the Edge Dialect, which carries the signature ``` - func: convolution(Tensor input, Tensor weight, Tensor? bias, int[] stride, SymInt[] padding, int[] dilation, bool transposed, SymInt[] output_padding, int groups) -> Tensor ``` ## Summary (cases handled) We introduce support for the convolution cases covered by [ATen-VK's default SlidingWindow implementation](https://github.com/pytorch/pytorch/blob/09c72eaa3f69f90402c86a30abf4fc621298578c/aten/src/ATen/native/vulkan/ops/Convolution.cpp#L73). This is achieved by - reusing the [existing `conv2d.glsl`](https://github.com/pytorch/pytorch/blob/09c72eaa3f69f90402c86a30abf4fc621298578c/aten/src/ATen/native/vulkan/glsl/conv2d.glsl), and - [moving special weights prepacking from CPU](https://github.com/pytorch/pytorch/blob/09c72eaa3f69f90402c86a30abf4fc621298578c/aten/src/ATen/native/vulkan/ops/Convolution.cpp#L134-L235) to the GPU in `conv2d_prepack_weights.glsl`. We also include resizing support for dynamic shapes. Note that only height and width of the input can vary. ## Cases not handled The implementation is on-par with ATen-VK's SlidingWindow. This means the following cases are missing: 1. **Groups G > 1.** Largely not covered by ATen-VK. `G = in_channels` is covered by ATen-VK's Depthwise impl and will be added soon. 2. **Batch (input) N > 1.** Not covered by ATen-VK. 3. **Padding > 0 while Dilation, Kernel > 1.** Not covered by ATen-VK. ## Coming soon For our CUNET model, the first two are required and the third is useful. 1. Transpose convolution 2. Depthwise convolution (for completeness) 3. Pointwise convolution (for optimization) 4. Null bias Differential Revision: [D55346778](https://our.internmc.facebook.com/intern/diff/D55346778/) [ghstack-poisoned]

pytorch-bot · 2024-04-02T19:06:35Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/2812

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 8fee216 with merge base d3326a2 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-04-02T19:06:48Z