/Tech14h ago

NF-CoT uses normalizing flows to compress 385 chain-of-thought tokens into 64 continuous latent reasoning tokens

The approach remains fully compatible with standard KV-cache decoding.

102965122314.9K

#99

Original post

alphaXiv@askalphaxiv

"Latent Reasoning with Normalizing Flows"

NF-CoT makes latent reasoning feel native to LLMs. So instead of forcing every intermediate thought through verbose CoT text, it learns compact continuous thoughts with a normalizing flow inside the causal LLM stream.

The key move is that latent thoughts become sampleable, scoreable, and RL-trainable like tokens, with exact likelihoods and KV-cache friendly decoding.

This beats explicit CoT and prior latent methods, while using 64 latent tokens to compress roughly 385 CoT tokens and running much faster than diffusion-based latent reasoning.

9:54 AM · Jun 8, 2026 · 14.9K Views

/Tech14h ago

NF-CoT uses normalizing flows to compress 385 chain-of-thought tokens into 64 continuous latent reasoning tokens

The approach remains fully compatible with standard KV-cache decoding.

102965122314.9K

#99

Original post

alphaXiv@askalphaxiv

"Latent Reasoning with Normalizing Flows"

The key move is that latent thoughts become sampleable, scoreable, and RL-trainable like tokens, with exact likelihoods and KV-cache friendly decoding.

This beats explicit CoT and prior latent methods, while using 64 latent tokens to compress roughly 385 CoT tokens and running much faster than diffusion-based latent reasoning.

9:54 AM · Jun 8, 2026 · 14.9K Views

Sentiment

Users praise NF-CoT for using Normalizing Flows to compress chain-of-thought reasoning into far fewer latents than diffusion methods, calling the approach cleaner, KV-cache compatible, and promising.

Pos

100.0%

Neg

0.0%

5 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS1.3KBOOKMARKS6LIKES7

alphaXiv@askalphaxiv

read more: https://www.alphaxiv.org/abs/2606.06447

1d1.3K76

RETWEETS1

Nicholas Hyperion@nickhistgeek

@askalphaxiv 385 CoT tokens down to 64 latents. NF with exact likelihoods is way cleaner than diffusion for latent reasoning. KV-cache compatible too.

14h1482

REPLIES1

Gregor@bygregorr

@askalphaxiv not sure 'forcing' is the right word for verbose CoT those text tokens are also the audit trail. if reasoning moves fully into continuous latent space, how do you even inspect where it went wrong?

1d232

Kye Gomez (swarms)@KyeGomezB

@askalphaxiv Great paper

1d74511

Peter Schmidinger@PeteSchmidinger

@askalphaxiv NF-CoT is currently third on alphaXiv hot / trending papers👍@Guancheng_Tu @EthanFu0355525 @haoqik322 @thoma_gu

9h925

YoungSeong Kim@salam341353

@askalphaxiv Looks cool, leveraging Normalizing Flows allows the model to internalize CoT more efficiently than previous diffusion-based approaches. That said, the training complexity looks like a significant bottleneck. Hoping to see future work that addresses this.

14h83

That AI Guy@LewisWeldtech

@askalphaxiv Your welcome https://www.academia.edu/168407565/Foundations_of_Structural_Survivability_From_Compressive_Multi_Projection_Homology_to_Neuromodulatory_Autonomous_Control?source=swp_share

16h68

Guancheng Tu@Guancheng_Tu

Great question! In fact our method can perfectly solve this. Since we uses normalizing flows to model the continuous thoughts, the information is preserved, and they can be inverted and decided into readable text for inspection if we want. we also made a cool project page (https://nf-cot.vercel.app) with examples where the latent reasoning traces are decoded back into readable text for inspection. These examples show how different latent samples can lead to quite different implementation strategies, while still producing functionally correct programs. So latent reasoning does not have to mean giving up interpretability—we can still inspect and debug the sampled reasoning paths when needed. Would love for you to check it out!

18h16

That AI Guy@LewisWeldtech

@askalphaxiv https://www.academia.edu/168355445/The_Cubic_Manifold_Projection_Hypothesis_A_Methodology_for_Structural_Information_Management_CMPH_c_2025?source=swp_share

16h7

That AI Guy@LewisWeldtech

@askalphaxiv https://www.academia.edu/167718334/The_Unified_Agent_Harness_Neuromodulatory_Control_with_Cayley_Unitary_Adapters_for_Autonomous_AI_Research?source=swp_share

16h7