/Tech1d ago

Tilde Research releases Parallax, upgrading softmax attention to local-linear regression to fix boundary bias

It matches FlashAttention 3 performance on Hopper hardware

204162822561.1K

#86

Original post

Songlin Yang#249

Tilde@tilderesearch

http://x.com/i/article/2064258509294968832

10:22 AM · Jun 9, 2026 · 33.2K Views

/Tech1d ago

Tilde Research releases Parallax, upgrading softmax attention to local-linear regression to fix boundary bias

It matches FlashAttention 3 performance on Hopper hardware

204162822561.1K

#86

Original post

Songlin Yang#249

Tilde@tilderesearch

http://x.com/i/article/2064258509294968832

10:22 AM · Jun 9, 2026 · 33.2K Views

Sentiment

Positive users praised Tilde Research's Parallax local-linear attention correction and varlen attn plans as based or a win, while negative users called the work crazy and malicious.

Pos

50.0%

Neg

50.0%

4 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS18.8KBOOKMARKS113LIKES164REPLIES6

rohan anil@_arohan_

Tilde folks are cracked. They won’t stop!

1d18.8K164113

RETWEETS10

Zhaoran Wang@zhaoran_wang

open-source is the resistance force against "king wanna bes" like @DarioAmodei, who are trying to monopolize and control future of humanity.

build, share, and rebel!

1d10.4K13728

Yifei Zuo@YifeiZuoX

Thanks @tilderesearch for making this blog post! A few future directions for Parallax I find interesting: - Optimizer: understanding why optimizer interacts so strongly with the Parallax correction, and what that implies for attention more broadly. - Architecture: developing the nonparametric counterpart of DeltaNet, a mechanism sitting between Parallax and LLA. - System: Parallax keeps the structure of standard attention, so it should compose with attention sparsity optimizations. - Post-training: with W_R = 0, Parallax is standard attention, so it can be initialized from a pretrained checkpoint and adapted. I'm curious whether W_R could serve as a steering parameter for RL.

1d2.7K2918

llm_enjoyer@LLMenjoyer

@YifeiZuoX @tilderesearch BTW do you have any plans for future parallax kernel releases? specifically better MFU, varlen, etc?

1d582

josepha.mayo@josepha_mayo

@zhaoran_wang @DarioAmodei exactly kinda stuff i mentioned here

1d241

Yifei Zuo@YifeiZuoX

@LLMenjoyer @tilderesearch yeah, varlen is planned @zz30gs

1d151

Zhichen Zeng@zz30gs

@LLMenjoyer @YifeiZuoX @tilderesearch thanks for your interest. we are planning varlen attn

1d71

llm_enjoyer@LLMenjoyer

@zz30gs @YifeiZuoX @tilderesearch w

1d7

llm_enjoyer@LLMenjoyer

@YifeiZuoX @tilderesearch @zz30gs based

1d7

Zhaoran Wang@zhaoran_wang

@josepha_mayo @DarioAmodei it is literally crazy and malicious

1d1