v2.4.8 HGEMM WMMA Part-1
What's Changed
- [GELU] Add f32/x4, f16/x2/x8/x8pack kernel. by @bear-zd in https://github.com/DefTruth/CUDA-Learn-Notes/pull/66
- [HGEMM] HGEMM Tensor Cores Support Part-1 by @DefTruth in https://github.com/DefTruth/CUDA-Learn-Notes/pull/67
Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.7...v2.4.8