7h agoYifei Zuo releases Parallax, a local linear attention architecture that beats standard attention when paired with the Muon optimizerA custom decode kernel matches or exceeds FlashAttention performance.