You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[ET-VK] De vectorise conv2d pw shader to improve perf.
Pull Request resolved: #11108
This diff optimizes the performance of the `conv2d_pw` shader by de-vectorizing its implementation.
* The original vectorized implementation of the `conv2d_pw` shader has been replaced with a de-vectorized approach to improve performance.
* The `sum` array has been redefined to hold `float` values instead of `vec4` to accommodate the de-vectorized computation.
These changes seem to allow shader compiler to better optimize operations within the shader hence improving perf.
ghstack-source-id: 286652100
@exported-using-ghexport
Differential Revision: [D75307267](https://our.internmc.facebook.com/intern/diff/D75307267/)
0 commit comments