Skip to content

Releases: xlite-dev/LeetCUDA

HGEMM Up to 115 TFLOPS:L20

21 Oct 12:55
a2934b9
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.13...v2.4.15

HGEMM Up to 113 TFLOPS:L20

21 Oct 01:56
0aeb450
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.12...v2.4.13

v2.4.12 SGEMM TF32 Swizzle

17 Oct 02:24
8c6922b
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.11...v2.4.12

v2.4.11 HGEMM Block Swizzle

16 Oct 03:04
bc3d78e
Compare
Choose a tag to compare

v2.4.10 SGEMM TF32 Stage 2/3

15 Oct 02:04
2906e78
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.9...v2.4.10

v2.4.9 HGEMM WMMA Stage

13 Oct 09:15
3acd5e2
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.8...v2.4.9

v2.4.8 HGEMM WMMA Part-1

11 Oct 11:05
5aef1b1
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.7...v2.4.8

v2.4.7 SGEMM Copy Async

10 Oct 06:16
3b56750
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.6...v2.4.7

v2.4.6 HGEMM Copy Async

08 Oct 03:48
bbec7b5
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.5...v2.4.6

v2.4.5 HGEMM Double Buffers

30 Sep 07:47
3f5ace3
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.4.4...v2.4.5