v2.4.7 SGEMM Copy Async
What's Changed
- [SGEMM][Async] Add naive copy async SGEMM by @DefTruth in https://github.com/DefTruth/CUDA-Learn-Notes/pull/64
- [SGEMM][Async] Add K16 + Copy Async Kernel by @DefTruth in https://github.com/DefTruth/CUDA-Learn-Notes/pull/65
Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.6...v2.4.7