[ET-VK][BE] vTensor cleanup 6/N - Do not use `gpu_memory_layout` as a source of truth, use `packed_dim_whcn_idx` directly #5479

SS-JIA · 2024-09-18T23:17:48Z

Stack from ghstack (oldest at bottom):

[ET-VK][BE][ez] vTensor cleanup 7/N - Blanket replacement of packed_dim_whcn_idx with packed_dim #5484
-> [ET-VK][BE] vTensor cleanup 6/N - Do not use gpu_memory_layout as a source of truth, use packed_dim_whcn_idx directly #5479

Context

GPUMemoryLayout is not a sufficient description of how a tensor is laid out in GPU memory. For buffer backed tensors, this was true ever since strides were introduced; for texture backed tensors, this is true since the introduction of axis_map.

For buffer backed tensors, the strides of the tensor is required to fully represent how the data of the tensor is laid out in GPU memory.

For texture backed tensors, the axis_map and packed_dim_whcn_idx is required to fully represent the layout of the tensor as an image texture.

Furthermore, with the introduction of functions like virtual_transpose(), tensor layouts may be produced which cannot be captured cleanly by an enum.

This diff decouples GPUMemoryLayout from vTensor. Rather than storing it as a tensor property, it is only used during construction to determine the initial tensor layout metadata.

The layout of a tensor can be estimated afterwards using estimate_memory_layout(), but this is only a "best effort" at producing a comparable memory layout.

GPUMemoryLayout was helpful as a compact representation of the packed_dim_whcn_idx of the tensor, which identifies the "fastest moving dimension" in buffer backed tensors, or which dim is packed along a texel for texture backed tensors. Therefore, whenever GPUMemoryLayout is referenced, what is really of interest is the packed dim index. Therefore, this diff also replaces references to memory_layout() to reference packed_dim_whcn_idx() instead.

Differential Revision: D62995121

… source of truth, use `packed_dim_whcn_idx` directly ## Context `GPUMemoryLayout` is not a sufficient description of how a tensor is laid out in GPU memory. For buffer backed tensors, this was true ever since strides were introduced; for texture backed tensors, this is true since the introduction of `axis_map`. For buffer backed tensors, the `strides` of the tensor is required to fully represent how the data of the tensor is laid out in GPU memory. For texture backed tensors, the `axis_map` and `packed_dim_whcn_idx` is required to fully represent the layout of the tensor as an image texture. Furthermore, with the introduction of functions like `virtual_transpose()`, tensor layouts may be produced which cannot be captured cleanly by an enum. This diff decouples `GPUMemoryLayout` from `vTensor`. Rather than storing it as a tensor property, it is only used during construction to determine the initial tensor layout metadata. The layout of a tensor can be estimated afterwards using `estimate_memory_layout()`, but this is only a "best effort" at producing a comparable memory layout. `GPUMemoryLayout` was helpful as a compact representation of the `packed_dim_whcn_idx` of the tensor, which identifies the "fastest moving dimension" in buffer backed tensors, or which dim is packed along a texel for texture backed tensors. Therefore, whenever `GPUMemoryLayout` is referenced, what is really of interest is the packed dim index. Therefore, this diff also replaces references to `memory_layout()` to reference `packed_dim_whcn_idx()` instead. Differential Revision: [D62995121](https://our.internmc.facebook.com/intern/diff/D62995121/) [ghstack-poisoned]

pytorch-bot · 2024-09-18T23:17:51Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5479

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 1 Pending

As of commit 6f363c6 with merge base 8ef6c79 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

… source of truth, use `packed_dim_whcn_idx` directly ## Context `GPUMemoryLayout` is not a sufficient description of how a tensor is laid out in GPU memory. For buffer backed tensors, this was true ever since strides were introduced; for texture backed tensors, this is true since the introduction of `axis_map`. For buffer backed tensors, the `strides` of the tensor is required to fully represent how the data of the tensor is laid out in GPU memory. For texture backed tensors, the `axis_map` and `packed_dim_whcn_idx` is required to fully represent the layout of the tensor as an image texture. Furthermore, with the introduction of functions like `virtual_transpose()`, tensor layouts may be produced which cannot be captured cleanly by an enum. This diff decouples `GPUMemoryLayout` from `vTensor`. Rather than storing it as a tensor property, it is only used during construction to determine the initial tensor layout metadata. The layout of a tensor can be estimated afterwards using `estimate_memory_layout()`, but this is only a "best effort" at producing a comparable memory layout. `GPUMemoryLayout` was helpful as a compact representation of the `packed_dim_whcn_idx` of the tensor, which identifies the "fastest moving dimension" in buffer backed tensors, or which dim is packed along a texel for texture backed tensors. Therefore, whenever `GPUMemoryLayout` is referenced, what is really of interest is the packed dim index. Therefore, this diff also replaces references to `memory_layout()` to reference `packed_dim_whcn_idx()` instead. Differential Revision: [D62995121](https://our.internmc.facebook.com/intern/diff/D62995121/) ghstack-source-id: 243424497 Pull Request resolved: #5479

facebook-github-bot · 2024-09-18T23:18:05Z

This pull request was exported from Phabricator. Differential Revision: D62995121

…ayout` as a source of truth, use `packed_dim_whcn_idx` directly" ## Context `GPUMemoryLayout` is not a sufficient description of how a tensor is laid out in GPU memory. For buffer backed tensors, this was true ever since strides were introduced; for texture backed tensors, this is true since the introduction of `axis_map`. For buffer backed tensors, the `strides` of the tensor is required to fully represent how the data of the tensor is laid out in GPU memory. For texture backed tensors, the `axis_map` and `packed_dim_whcn_idx` is required to fully represent the layout of the tensor as an image texture. Furthermore, with the introduction of functions like `virtual_transpose()`, tensor layouts may be produced which cannot be captured cleanly by an enum. This diff decouples `GPUMemoryLayout` from `vTensor`. Rather than storing it as a tensor property, it is only used during construction to determine the initial tensor layout metadata. The layout of a tensor can be estimated afterwards using `estimate_memory_layout()`, but this is only a "best effort" at producing a comparable memory layout. `GPUMemoryLayout` was helpful as a compact representation of the `packed_dim_whcn_idx` of the tensor, which identifies the "fastest moving dimension" in buffer backed tensors, or which dim is packed along a texel for texture backed tensors. Therefore, whenever `GPUMemoryLayout` is referenced, what is really of interest is the packed dim index. Therefore, this diff also replaces references to `memory_layout()` to reference `packed_dim_whcn_idx()` instead. Differential Revision: [D62995121](https://our.internmc.facebook.com/intern/diff/D62995121/) [ghstack-poisoned]

facebook-github-bot · 2024-09-19T15:44:39Z

This pull request was exported from Phabricator. Differential Revision: D62995121

…ayout` as a source of truth, use `packed_dim_whcn_idx` directly" ## Context `GPUMemoryLayout` is not a sufficient description of how a tensor is laid out in GPU memory. For buffer backed tensors, this was true ever since strides were introduced; for texture backed tensors, this is true since the introduction of `axis_map`. For buffer backed tensors, the `strides` of the tensor is required to fully represent how the data of the tensor is laid out in GPU memory. For texture backed tensors, the `axis_map` and `packed_dim_whcn_idx` is required to fully represent the layout of the tensor as an image texture. Furthermore, with the introduction of functions like `virtual_transpose()`, tensor layouts may be produced which cannot be captured cleanly by an enum. This diff decouples `GPUMemoryLayout` from `vTensor`. Rather than storing it as a tensor property, it is only used during construction to determine the initial tensor layout metadata. The layout of a tensor can be estimated afterwards using `estimate_memory_layout()`, but this is only a "best effort" at producing a comparable memory layout. `GPUMemoryLayout` was helpful as a compact representation of the `packed_dim_whcn_idx` of the tensor, which identifies the "fastest moving dimension" in buffer backed tensors, or which dim is packed along a texel for texture backed tensors. Therefore, whenever `GPUMemoryLayout` is referenced, what is really of interest is the packed dim index. Therefore, this diff also replaces references to `memory_layout()` to reference `packed_dim_whcn_idx()` instead. Differential Revision: [D62995121](https://our.internmc.facebook.com/intern/diff/D62995121/) [ghstack-poisoned]

facebook-github-bot · 2024-09-19T17:33:43Z

This pull request was exported from Phabricator. Differential Revision: D62995121

facebook-github-bot · 2024-09-19T18:33:36Z

This pull request has been merged in 7c6d58a.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 18, 2024

facebook-github-bot added the fb-exported label Sep 18, 2024

SS-JIA mentioned this pull request Sep 19, 2024

[ET-VK][BE][ez] vTensor cleanup 7/N - Blanket replacement of packed_dim_whcn_idx with packed_dim #5484

Closed

jorgep31415 approved these changes Sep 19, 2024

View reviewed changes

facebook-github-bot closed this in 7c6d58a Sep 19, 2024

facebook-github-bot added the Merged label Sep 19, 2024

SS-JIA deleted the gh/SS-JIA/84/head branch January 24, 2025 19:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ET-VK][BE] vTensor cleanup 6/N - Do not use `gpu_memory_layout` as a source of truth, use `packed_dim_whcn_idx` directly #5479

[ET-VK][BE] vTensor cleanup 6/N - Do not use `gpu_memory_layout` as a source of truth, use `packed_dim_whcn_idx` directly #5479

Uh oh!

SS-JIA commented Sep 18, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 18, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Sep 18, 2024

Uh oh!

facebook-github-bot commented Sep 19, 2024

Uh oh!

facebook-github-bot commented Sep 19, 2024

Uh oh!

facebook-github-bot commented Sep 19, 2024

Uh oh!

Uh oh!

[ET-VK][BE] vTensor cleanup 6/N - Do not use gpu_memory_layout as a source of truth, use packed_dim_whcn_idx directly #5479

[ET-VK][BE] vTensor cleanup 6/N - Do not use gpu_memory_layout as a source of truth, use packed_dim_whcn_idx directly #5479

Uh oh!

Conversation

SS-JIA commented Sep 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Uh oh!

pytorch-bot bot commented Sep 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5479

⏳ No Failures, 1 Pending

Uh oh!

facebook-github-bot commented Sep 18, 2024

Uh oh!

facebook-github-bot commented Sep 19, 2024

Uh oh!

facebook-github-bot commented Sep 19, 2024

Uh oh!

facebook-github-bot commented Sep 19, 2024

Uh oh!

Uh oh!

[ET-VK][BE] vTensor cleanup 6/N - Do not use `gpu_memory_layout` as a source of truth, use `packed_dim_whcn_idx` directly #5479

[ET-VK][BE] vTensor cleanup 6/N - Do not use `gpu_memory_layout` as a source of truth, use `packed_dim_whcn_idx` directly #5479

SS-JIA commented Sep 18, 2024 •

edited

Loading

pytorch-bot bot commented Sep 18, 2024 •

edited

Loading