Enable restricted split + cat in order to enable SP #253

drisspg · 2024-04-25T03:50:34Z

Summary

This comes from needing to support sequence parallelism in torchtitan

wanchaol · 2024-04-29T02:20:10Z

float8_experimental/float8_ops.py

+    return list(out)
+
+
+# Errors cant `cat_cuda float8 e4m3fn`


oh so this means that the torch.cat can't apply to dtype e4m3fn?

Normally I feel this is something that we can just make our cuda kernel to support concatting tensors with the same dtype, but not sure if there're further complications there for fp8 dtype.

But if the job we want to do is to simply concatting the fp8 tensors together, one simpler way to can do:

we can just try to do fp8_inner_tensor.view(torch.uint8), perform torch.cat, then after the cat operation, we do fp8_catted_tensor.view(torch.float8_e4m3fn, I wonder if this would unblock?

wanchaol

sgtm! thanks for supporting this!

facebook-github-bot · 2024-05-08T22:47:16Z

@drisspg has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-05-08T23:34:11Z

@drisspg has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-05-08T23:36:59Z

@drisspg has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-05-08T23:39:01Z

@drisspg has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-05-09T01:03:48Z

@drisspg merged this pull request in cb55df2.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 25, 2024

wanchaol reviewed Apr 29, 2024

View reviewed changes

wanchaol approved these changes May 8, 2024

View reviewed changes

drisspg changed the title ~~Attempt to unblock SP but needs some more thought~~ Enable restricted split + cat in order to enable SP May 8, 2024

Attempt to unblock SP but needs some more thought

e4218e1

drisspg force-pushed the Enable-sequence-parallelism branch from 74d7c9e to 2623617 Compare May 8, 2024 23:33

drisspg force-pushed the Enable-sequence-parallelism branch from 2623617 to 2d480fd Compare May 8, 2024 23:36

view as data

37363af

drisspg force-pushed the Enable-sequence-parallelism branch from 2d480fd to 37363af Compare May 8, 2024 23:38

facebook-github-bot closed this in cb55df2 May 9, 2024

facebook-github-bot added the Merged label May 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable restricted split + cat in order to enable SP #253

Enable restricted split + cat in order to enable SP #253

Uh oh!

drisspg commented Apr 25, 2024 •

edited

Loading

Uh oh!

wanchaol Apr 29, 2024

Uh oh!

wanchaol left a comment

Uh oh!

facebook-github-bot commented May 8, 2024

Uh oh!

facebook-github-bot commented May 8, 2024

Uh oh!

facebook-github-bot commented May 8, 2024

Uh oh!

facebook-github-bot commented May 8, 2024

Uh oh!

facebook-github-bot commented May 9, 2024

Uh oh!

Uh oh!

Enable restricted split + cat in order to enable SP #253

Enable restricted split + cat in order to enable SP #253

Uh oh!

Conversation

drisspg commented Apr 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

wanchaol Apr 29, 2024

Choose a reason for hiding this comment

Uh oh!

wanchaol left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented May 8, 2024

Uh oh!

facebook-github-bot commented May 8, 2024

Uh oh!

facebook-github-bot commented May 8, 2024

Uh oh!

facebook-github-bot commented May 8, 2024

Uh oh!

facebook-github-bot commented May 9, 2024

Uh oh!

Uh oh!

drisspg commented Apr 25, 2024 •

edited

Loading