Skip to content

Releases: xlite-dev/LeetCUDA

v2.4.6 HGEMM Copy Async

08 Oct 03:48
bbec7b5
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.5...v2.4.6

v2.4.5 HGEMM Double Buffers

30 Sep 07:47
3f5ace3
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.4...v2.4.5

v2.4.4 Pack HGEMM

29 Sep 11:01
7cf1879
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.3...v2.4.4

v2.4.3 Pack Softmax

27 Sep 02:00
5901796
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.2...v2.4.3

v2.4.2 Pack RMSNorm

26 Sep 01:14
54c761d
Compare
Choose a tag to compare

v2.4.1 Pack LayerNorm

25 Sep 06:07
4667308
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4...v2.4.1

v2.4 Pack Reduce LDST

24 Sep 02:13
bf283f2
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.3.1...v2.4

v2.3.1 f16x8 Pack Elementwise

23 Sep 03:44
d43c53d
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.3...v2.3.1

v2.3 Refactor 6/N

17 Sep 07:57
f9001b9
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.2...v2.3

v2.2 Refactor 5/N

12 Sep 01:36
86ab98e
Compare
Choose a tag to compare