@Jolyne_AI: GitHub Open-Source CUDA System Tutorial: LeetCUDA (From Beginner to Advanced, All in One) 200+ Progressive CUDA Kernel Practice Problems, with a companion HGEMM library achieving 98%–100% of cuBLAS performance. Plus 100+ articles on high-performance computing...

X AI KOLs Timeline 06/28/26, 03:00 PM Tools

cuda gpu high-performance-computing open-source pytorch kernel tutorial

Summary

LeetCUDA is an open-source CUDA system tutorial on GitHub, featuring over 200 progressive CUDA Kernel practice problems and 100+ high-performance computing blog posts. Its companion HGEMM library achieves 98%–100% of cuBLAS performance, making it ideal for CUDA beginners and AI engineers to systematically master CUDA optimization.

GitHub Open-Source CUDA System Tutorial: LeetCUDA (From Beginner to Advanced, All in One) 200+ progressive CUDA Kernel practice problems, with a companion HGEMM library achieving 98%–100% of cuBLAS performance. Plus 100+ high-performance computing blog posts focusing on key techniques and optimization methods, helping you advance from "being able to write" to "writing fast and stable code." GitHub: http://github.com/xlite-dev/LeetCUDA… Carefully designed for beginners, combined with PyTorch to outline a clear path: write correctly → write fast → approach library-level performance. Suitable for developers aiming to master CUDA systematically, and also as a reference and advancement path for AI engineers working on large model inference optimization.

Original Article

View Cached Full Text

Cached at: 06/29/26, 06:23 AM

📚 LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners 🐑

🔥🔥 PR Welcome: Add Your Kernel to LeetCUDA! Let’s make it Awesome together! 🎉🎉

@Jolyne_AI: GitHub Open-Source CUDA System Tutorial: LeetCUDA (From Beginner to Advanced, All in One) 200+ Progressive CUDA Kernel Practice Problems, with a companion HGEMM library achieving 98%–100% of cuBLAS performance. Plus 100+ articles on high-performance computing...

Similar Articles

https://www.youtube.com/watch?v=qRLyoP8zOyQ

@neural_avb: TIL about "GPU Mode" They got a youtube series to learn CUDA. Plus a github repo with slides/notebooks. Some lectures a…

@0x0SojalSec: Fuck your paid courses, Master GPU engineering for AI systems. From foundational books and CUDA/ROCm programming to low…

Every AI researcher should grasp inference acceleration—CUDA Graph is the heart of vLLM's GPU efficiency

CUDA Books

Submit Feedback

Similar Articles

https://www.youtube.com/watch?v=qRLyoP8zOyQ

@neural_avb: TIL about "GPU Mode" They got a youtube series to learn CUDA. Plus a github repo with slides/notebooks. Some lectures a…

@0x0SojalSec: Fuck your paid courses, Master GPU engineering for AI systems. From foundational books and CUDA/ROCm programming to low…

Every AI researcher should grasp inference acceleration—CUDA Graph is the heart of vLLM's GPU efficiency