/Tech42d ago

arXiv paper 'RoPE Distinguishes Neither Positions Nor Tokens in Long Contexts, Provably' proves rotary position embeddings lose locality bias and token relevance as context length grows

AI Judge changed title after evaluation, original title: "arXiv paper states rotary position embeddings lose locality bias in long contexts"

You Jiacheng notes the analysis assumes uniform norms that do not hold in practice.

187065252476.4K

#113

Original post

Delip Rao e/σ@deliprao#113inTech

Ouch

6:46 AM · May 18, 2026 · 67K Views

Sentiment

Negative users dismiss the paper proving RoPE fails in long contexts as offering nothing new beyond what was already known.

Pos

0.0%

Neg

100.0%

1 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS9.5KBOOKMARKS30LIKES51RETWEETS1REPLIES2

You Jiacheng@YouJiacheng

this paper sounds off. for position part, it assumes that 2d sub-vectors of q&k (RoPE rotates these 2d sub-vectors) have basically uniform norm, which is not realistic. for content part, we can use partial RoPE.

Delip Rao e/σ@deliprao

Ouch

42d9.5K5130

catid@MrCatid

@deliprao https://arxiv.org/pdf/2605.15514

42d80311

Shashank Deshpande@ShashankDe5535

@deliprao this could have been a matplot?

42d7394

Shreshth Saini@shreshthsaini

@deliprao We already know that RoPE doesn't work well for long contexts. Nothing new here, maybe except they are theoretically proving it.

42d3622

laxman@llmluthor

@deliprao Sheesh... Any better alternative than rope which work at scale?

42d7511

Artur Tanona@ArturTanona

@deliprao BUT it works sufficiently to provide some good results, so the order does matter or not for inference?

42d383