Skip to content
View theRTLmaker's full-sized avatar

Highlights

  • Pro

Organizations

@HackerSchool

Block or report theRTLmaker

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
theRTLmaker/README.md

Low-level systems engineer working at the boundary of hardware, GPUs and ML performance

๐Ÿ‘‹ About me

I work close to the metal.

My background is in CPU architecture, verification and memory systems, but my day-to-day curiosity lives in low-level programming: GPU kernels, performance modeling, and how real workloads stress real hardware.

I care about understanding systems end-to-end, from cache lines and warps up to training loops and frameworks, and using that understanding to make things faster and more predictable.

๐Ÿ”ญ Current focus

100 Days of CUDA

A hands-on deep dive into GPU programming, kernel design and performance behavior.

Repo:
https://github.com/theRTLmaker/CUDA_in_100_days


๐ŸŒฑ Actively building depth in

  • CUDA and GPU kernel optimization
  • GPU memory hierarchies and profiling
  • Performance modeling and benchmarking
  • ML training workloads and system bottlenecks

โš™๏ธ Technical toolkit

  • Low-level C++ and performance-oriented Python
  • CUDA programming and GPU profiling tools
  • CPU microarchitecture, caches and coherency
  • SystemVerilog and hardware-software interfaces

๐Ÿง  Interests

  • GPU and accelerator programming
  • ML systems and performance engineering
  • Hardware-aware software design
  • Debugging at uncomfortable layers
  • Making abstractions earn their keep

๐Ÿ“ฌ Connect

Always up for conversations about GPUs, low-level systems, performance engineering or how software really hits the hardware.

Pinned Loading

  1. GPU-Characterization GPU-Characterization Public

    AMD GPU Methodology for Non-Conventional DVFS Characterization

    C++ 2

  2. My-Triton-Puzzles-Solutions My-Triton-Puzzles-Solutions Public

    A man learning its Triton coding

    Jupyter Notebook

  3. CUDA_in_100_days CUDA_in_100_days Public

    A deep dive of creating a CUDA kernel per day.

    Cuda