v2.4.3 Pack Softmax
What's Changed
- [LayerNorm][FP16] support fp16x8_pack_f32 kernel by @DefTruth in https://github.com/DefTruth/CUDA-Learn-Notes/pull/48
- [Softmax][FP16] Pack f16x8 softmax kernel by @DefTruth in https://github.com/DefTruth/CUDA-Learn-Notes/pull/49
Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.2...v2.4.3