/Tech1h ago

Yuzhen Mao's team releases DeLM, a decentralized multi-agent framework achieving 65.7% on SWE-bench Verified for $0.12 per task

The framework coordinates parallel agents without a central controller.

9927428.3K
Original post
Azalia Mirhoseini@Azaliamirh#210inTech

Introducing Decentralized Language Models (DeLM)!

DeLM is a multi-agent framework that enables asynchronous, verified & reusable progress!

It makes agentic tasks more accurate and significantly cheaper. For example, it achieves 65.7% on SWE-bench Verified using Gemini 3-Flash, a ~10% jump over the best centralized alternatives at less than half the cost.

Great work led by @Mao_Yuzhen !

1:41 PM · Jun 10, 2026 · 5.6K Views
Sentiment
Sentiment building, check back later.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS4.5KBOOKMARKS37LIKES48RETWEETS4REPLIES5
alex zhang@a1zhang

wait this is so cool LOL

in theory if we hillclimb RLMs maybe they become incentivized to launch code blocks in this way

49mViews 4.5KLikes 48Bookmarks 37

At a high level, here is how DeLM differs from a centralized framework:

Introducing Decentralized Language Models (DeLM)!

DeLM is a multi-agent framework that enables asynchronous, verified & reusable progress!

It makes agentic tasks more accurate and significantly cheaper. For example, it achieves 65.7% on SWE-bench Verified using Gemini 3-Flash, a ~10% jump over the best centralized alternatives at less than half the cost.

Great work led by @Mao_Yuzhen !

1hViews 332Likes 4Bookmarks 2
Omar Khattab@lateinteraction

@a1zhang it’s certainly one RLM strategy 👀

alex zhang@a1zhang

wait this is so cool LOL

in theory if we hillclimb RLMs maybe they become incentivized to launch code blocks in this way

45mViews 553Likes 7Bookmarks 1

DeLM in action!

At a high level, here is how DeLM differs from a centralized framework:

1hViews 417Likes 6Bookmarks 1

You can read more about DeLM and try it yourself:

Paper: https://arxiv.org/abs/2606.10662 Website: https://yuzhenmao.github.io/DeLM/

DeLM in action!

1hViews 189Likes 3Bookmarks 1
Hunter Bown@goodhunt

@a1zhang Have you tried the muse spark contemplating mode? I can’t tell what’s going on in the backend from what they show but it looks very interesting

46mViews 55Likes 1Bookmarks 1
Drew Breunig@dbreunig

@lateinteraction @a1zhang Swarlm!

43mViews 56
Adam Elkassas@adampredev

@a1zhang @lateinteraction @predotdev

40mViews 10
Max Headroom@CosmicMonad

@lateinteraction @a1zhang What are some others? How would you rank them?

33mViews 4