native_layer_norm (for width dim) #3001

copyrightly · 2024-04-11T22:27:01Z

Summary:
We implement native_layer_norm which has 3 outputs

normalization of the input tensor according to the given normalized_shape
mean
1/sqrt(var + eps)

https://www.internalfb.com/code/fbsource/[8db4b5872791bb88a62ecaa60b667ee4c1b189bf]/fbcode/caffe2/aten/src/ATen/native/native_functions.yaml?lines=3252

According to SS-JIA's suggestion, a model specific implementation is more performant and preferred to a generic one. So we implemented the op in the following optimized way

our current use case has normalized_shape of len 1, namely we do the normalization through computing the mean and var at the last width dim
we do the computation in just one shader native_layer_norm.glsl without invoking the shaders to compute mean and var respectively
we use Welford's online algorithm to compute mean and variance in one pass

Differential Revision: D56005629

pytorch-bot · 2024-04-11T22:27:04Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/3001

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit cec4574 with merge base b1edc3d ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-04-11T22:27:12Z

This pull request was exported from Phabricator. Differential Revision: D56005629

Summary: We implement `native_layer_norm` which has 3 outputs - normalization of the input tensor according to the given `normalized_shape` - mean - 1/sqrt(var + eps) https://www.internalfb.com/code/fbsource/[8db4b5872791bb88a62ecaa60b667ee4c1b189bf]/fbcode/caffe2/aten/src/ATen/native/native_functions.yaml?lines=3252 According to SS-JIA's suggestion, a model specific implementation is more performant and preferred to a generic one. So we implemented the op in the following optimized way - our current use case has `normalized_shape` of len 1, namely we do the normalization through computing the mean and var at the last width dim - we do the computation in just one shader `native_layer_norm.glsl` without invoking the shaders to compute mean and var respectively - we use [Welford's online algorithm](https://en.wikipedia.org/wiki/Algorithms_for_calculating_variance#Welford's_online_algorithm) to compute mean and variance in one pass Differential Revision: D56005629

facebook-github-bot · 2024-04-12T17:57:39Z

This pull request was exported from Phabricator. Differential Revision: D56005629

Summary: We implement `native_layer_norm` which has 3 outputs - normalization of the input tensor according to the given `normalized_shape` - mean - 1/sqrt(var + eps) https://www.internalfb.com/code/fbsource/[8db4b5872791bb88a62ecaa60b667ee4c1b189bf]/fbcode/caffe2/aten/src/ATen/native/native_functions.yaml?lines=3252 According to SS-JIA's suggestion, a model specific implementation is more performant and preferred to a generic one. So we implemented the op in the following optimized way - our current use case has `normalized_shape` of len 1, namely we do the normalization through computing the mean and var at the last width dim - we do the computation in just one shader `native_layer_norm.glsl` without invoking the shaders to compute mean and var respectively - we use [Welford's online algorithm](https://en.wikipedia.org/wiki/Algorithms_for_calculating_variance#Welford's_online_algorithm) to compute mean and variance in one pass Differential Revision: D56005629

facebook-github-bot · 2024-04-15T21:16:22Z

This pull request has been merged in 74576e8.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 11, 2024

facebook-github-bot added the fb-exported label Apr 11, 2024

copyrightly force-pushed the export-D56005629 branch from d0d86da to cec4574 Compare April 12, 2024 17:57

jorgep31415 approved these changes Apr 15, 2024

View reviewed changes

facebook-github-bot closed this in 74576e8 Apr 15, 2024

facebook-github-bot added the Merged label Apr 15, 2024

mergennachin mentioned this pull request Apr 26, 2024

disclaimer #3376

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

native_layer_norm (for width dim) #3001

native_layer_norm (for width dim) #3001

Uh oh!

copyrightly commented Apr 11, 2024

Uh oh!

pytorch-bot bot commented Apr 11, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Apr 11, 2024

Uh oh!

facebook-github-bot commented Apr 12, 2024

Uh oh!

facebook-github-bot commented Apr 15, 2024

Uh oh!

Uh oh!

native_layer_norm (for width dim) #3001

native_layer_norm (for width dim) #3001

Uh oh!

Conversation

copyrightly commented Apr 11, 2024

Uh oh!

pytorch-bot bot commented Apr 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/3001

✅ No Failures

Uh oh!

facebook-github-bot commented Apr 11, 2024

Uh oh!

facebook-github-bot commented Apr 12, 2024

Uh oh!

facebook-github-bot commented Apr 15, 2024

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 11, 2024 •

edited

Loading