Feb. 25, 2024, 12:31 p.m. | /u/clueless_scientist

Machine Learning www.reddit.com

Hey, I am writing Triton kernels and the only way to debug the code as far as I know, is using tl.device\_print, which only works with tensor data (no shapes for you) and clogs the output. So I wrote a small tool to run kernels using just torch without changing the code. The only changes is reducing launch grid sizes and changing kernel wrappers to debug wrappers. Here's an example of a simple kernel:

import torch
import triton
# import …

code data debug debugging hey machinelearning small tensor tool torch triton writing

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Data Scientist

@ ITE Management | New York City, United States