Skip to content

Releases: xlite-dev/LeetCUDA

v3.0.9

12 May 01:53
27b76d0
Compare
Choose a tag to compare

What's Changed

  • feat: add some torch.distributed examples by @DefTruth in #313
  • feat: add some torch.distributed examples by @DefTruth in #315
  • feat: add a naive CuTe flash-attn by @botbw in #314
  • fix(kernels): correct typo in LayerNorm kernel at line 73 110 346 443 by @nxdxml in #317
  • misc: manually update submodules by @DefTruth in #318
  • chore: add naive cute flash-attn index by @DefTruth in #319
  • add triton merge_attn_states zhihu blog by @DefTruth in #320

New Contributors

Full Changelog: v3.0.8...v3.0.9

v3.0.8

06 May 06:23
a566b88
Compare
Choose a tag to compare

What's Changed

Full Changelog: v3.0.7...v3.0.8

LeetCUDA v3.0.7

28 Apr 06:02
7673ac6
Compare
Choose a tag to compare

What's Changed

Full Changelog: v3.0.6...v3.0.7

LeetCUDA v3.0.6

26 Apr 06:51
acaac78
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v3.0.5...v3.0.6

v3.0.5

09 Apr 15:15
ba6fac2
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v3.0.4...v3.0.5

v3.0.4

15 Mar 03:14
ca63606
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v3.0.3...v3.0.4

v3.0.3

04 Mar 04:14
077096a
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v3.0.2...v3.0.3

v3.0.2

24 Feb 01:30
a9e2d17
Compare
Choose a tag to compare

v3.0.1

06 Feb 12:08
ee9f706
Compare
Choose a tag to compare

v3.0.0

22 Jan 10:08
7f35ae1
Compare
Choose a tag to compare

What's Changed

Full Changelog: DefTruth/CUDA-Learn-Notes@v2.6.15...v3.0.0