Transformer Study Shows Value Vectors Read Original Tokens in Deep Layers · Digg