v3.0.9
What's Changed
- feat: add some torch.distributed examples by @DefTruth in #313
- feat: add some torch.distributed examples by @DefTruth in #315
- feat: add a naive CuTe flash-attn by @botbw in #314
- fix(kernels): correct typo in LayerNorm kernel at line 73 110 346 443 by @nxdxml in #317
- misc: manually update submodules by @DefTruth in #318
- chore: add naive cute flash-attn index by @DefTruth in #319
- add triton merge_attn_states zhihu blog by @DefTruth in #320
New Contributors
Full Changelog: v3.0.8...v3.0.9