Skip to content

Releases: xlite-dev/LeetCUDA

v2.4.17

29 Oct 06:39
a65f1f6
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.16...v2.4.17

HGEMM Warp Swizzle/Reg Buffers

25 Oct 05:59
6c89595
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.15...v2.4.16

HGEMM Up to 115 TFLOPS:L20

21 Oct 12:55
a2934b9
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.13...v2.4.15

HGEMM Up to 113 TFLOPS:L20

21 Oct 01:56
0aeb450
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.12...v2.4.13

v2.4.12 SGEMM TF32 Swizzle

17 Oct 02:24
8c6922b
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.11...v2.4.12

v2.4.11 HGEMM Block Swizzle

16 Oct 03:04
bc3d78e
Compare
Choose a tag to compare

v2.4.10 SGEMM TF32 Stage 2/3

15 Oct 02:04
2906e78
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.9...v2.4.10

v2.4.9 HGEMM WMMA Stage

13 Oct 09:15
3acd5e2
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.8...v2.4.9

v2.4.8 HGEMM WMMA Part-1

11 Oct 11:05
5aef1b1
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.7...v2.4.8

v2.4.7 SGEMM Copy Async

10 Oct 06:16
3b56750
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.6...v2.4.7