/AI8h ago

LSTM co-developer Sepp Hochreiter highlights Reasoning in Memory, a technique that runs LLM reasoning entirely within latent spaces

Bypassing autoregressive token overhead significantly speeds up LLM inference.

--0--
Original post
Sepp Hochreiter@HochreiterSepp#526inAI

Reasoning in Memory (RiM) enables latent reasoning without the need to "think out loud." By reasoning directly within a dedicated latent workspace—working memory—no overhead of generating explicit reasoning tokens. Dramatically faster inference with the same quality of reasoning.

8:58 AM · Jun 1, 2026 · 14.8K Views
Sentiment
Sentiment unavailable for this story.
Cluster Engagement
-
Views
-
Comments
-
Reposts
-
Bookmarks
Expand data
Posts from X
Most Activity
Most ActivityTimeline
VIEWS774BOOKMARKS3LIKES5RETWEETS1REPLIES1
Cody Blakeney@code_star

Do we think this will make the shape rotators happy because reasoning will be done without an inner monologue?

Sepp Hochreiter@HochreiterSepp

Reasoning in Memory (RiM) enables latent reasoning without the need to "think out loud." By reasoning directly within a dedicated latent workspace—working memory—no overhead of generating explicit reasoning tokens. Dramatically faster inference with the same quality of reasoning.

5hViews 774Likes 5Bookmarks 3