In this repository, I'm going to learn CUDA programming by following the lectures and notes from the repository provided by the CUDA-mode team.
I will be using the CUDA toolkit 12.1 on windows platform for this course, and I will use a NVIDIA GeForce RTX 4080 for coding.
🔗 Lecture 1: Profiling and Integrating CUDA kernels in PyTorch