arXiv paper 'RoPE Distinguishes Neither Positions Nor Tokens in Long Contexts, Provably' proves rotary position embeddings lose locality bias and token relevance as context length grows · Digg