/Tech3h ago

Researcher Pushes Back On Information Bottleneck Framing For Residual Streams

3600261

Original post

Now for the Information Bottleneck framing. I can't believe I'm the one pushing back on this part 😫, and even though I personally agree with the intuition (the residual stream navigating an IB tradeoff in depth is a beautiful way to think about it) - it's hand-wavy.

Ravid Shwartz Ziv@ziv_ravid

And the "at scale" part is the underrated contribution. All L layers still run, with no truncation — which means KV cache / continuous batching / tensor parallelism stay untouched. They only re-route which layer feeds the sampler.

11:25 AM · Jun 23, 2026 · 45 Views

Sentiment

Users praise the research pushing back on information bottleneck framing for residual streams as great work and thank the authors while anticipating future collaborations.

Pos

100.0%

Neg

0.0%

3 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

Ravid Shwartz Ziv@ziv_ravid

For example, in our attention-sinks work we showed which kinds of tasks benefit from intermediate vs. final-layer representations (embedding/compression-heavy tasks vs. generation). That task-dependence is the missing axis here

Ravid Shwartz Ziv@ziv_ravid

Entropy is not information. The whole method reads predictive entropy off the logit lens and treats the valley as the "representational zenith." But an approximation of the entropy of the output distribution ≠ the mutual information that the IB story actually invokes. The real question, and it's a very hard one, is how to measure the information.

3h17320

LIKES3

Ravid Shwartz Ziv@ziv_ravid

Anyway, great work @xuanmingzhangai Go to read it!

3h993

REPLIES1

Ravid Shwartz Ziv@ziv_ravid

3h4320

xuanming zhang@xuanmingzhangai

@ziv_ravid Sincerely thanks for the reading and all the valuable thoughts! Looking forward to have exciting conversations or cooperations in the future!

3h51

Ravid Shwartz Ziv@ziv_ravid

@xuanmingzhangai Thabk you!

3h6