You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[ExecuTorch] Allow using custom SDPA for non-float32 dtypes in llama demo
Converting the input to and from float32 is faster than not using the op. h/t to torchchat, which does this already (though it had a bug, which I sent a patch for).
Differential Revision: [D63158951](https://our.internmc.facebook.com/intern/diff/D63158951/)
ghstack-source-id: 244181863
Pull Request resolved: #5548
0 commit comments