171/365 of GPU Programming
Finally got below 2ms on the GPU Mode QR challenge. Still quite far away from CUDA colonel @blelbach, so I'm excited to read everyones' writeups once the competition is over.
I especially wonder how #1 is utilizing NVFP4 (or is the submission name a false flag?). Every lower precision attempt has not panned out for me so far unfortunately.
Very curious where the top 3 will converge in a week from now...
If you're still debating whether to participate, would highly recommend! You learn so much from just trying stuff and being in the discord. And @modal gives you $30 in monthly compute for free. Maybe we can get a bit more compute for the next challenge if we all ask @charles_irl nicely 😁😁
170/365 of GPU Programming
Just going through the motions of QR factorization and Gram Schmidt today while concepts around orthogonality, norms and projections.
It's been a while since I studied Linear Algebra, so it's been fun reviewing some older concepts and learning new ones as well.
Will spend some time learning more about Cholesky and Householder as well tomorrow.



