Joshua Lochner uses Fable 5 to generate WebGPU kernels running Gemma 4 at 255 tokens/sec on Apple M4 · Digg