MakoGenerate: AI-Powered GPU Kernel Generation in Under 60 Seconds

atallahw · 2025-06-25T15:17:13 1750864633

This was fun to work on. LLMs for writing kernels still has a long way to go. Its honestly a little surprising how decent they are now. I guess I've been pretty consistently "surprised" by codegen for a while now (meaning the last two years)

mohsaied · 2025-06-25T15:14:46 1750864486

This is the first step towards fully automated GPU performance optimization. The idea is to automatically generate GPU kernels, then automatically integrate them in vLLM/SGLang/PyTorch.

essamwisam · 2025-06-25T15:19:16 1750864756

Quite cool. It's interesting that the LLM is able to optimize code based on the target hardware itself.