/Tech11h ago

Cognition launches Devin Security Swarm with an Agentic MapReduce architecture to find and verify code vulnerabilities

The system runs specialized agents in sandboxes to verify findings.

342312016050.2K

#195

Original post

andrew gao@itsandrewgao#1852inTech

👏👏 @ido_pesok

Cognition@cognition

Introducing Devin Security Swarm

A more cost effective and accurate way to find security vulnerabilities in complex codebases, based on a new architecture: Agentic MapReduce.

1:10 PM · Jul 1, 2026 · 3.7K Views

Sentiment

Positive users praise the Agentic MapReduce framing in Devin Security Swarm for enabling orchestrated agent swarms with verification, while negative users question its sustainability as AI merely fixing vulnerabilities created by other AI.

Pos

66.7%

Neg

33.3%

3 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS36.6KBOOKMARKS136LIKES156RETWEETS16REPLIES24

Aaron Levie@levie

If you’ve ever wondered why we will need 100X more AI inference in the future, and what it’s going to be driven by, this is another good example.

Devin pushes forward an idea of agentic mapreduce, which means we’ll now have swarms of agents that are processing large amounts of data (code) to handle tasks that humans never could have done before.

“Devin maps relevant signals across the repo, fans out focused agents over bounded shards, reduces their findings into one report, then verifies serious vulnerabilities in isolated sandboxes before marking them confirmed.”

In this case it’s code security, but there are tons of other use-cases in code and knowledge work. We see this at Box with customers that want to process and understand millions of documents for risk, insights, relationships, and more. This will play out in pharma, banking, and many other industries across all forms of unstructured data.

As an aside, these types of capabilities are generally only possible when you can deploy a variety of models (both the frontier and lower cost) because of the sheer amount of tokens that go into these use-cases. This is going to be a major value proposition for the applied AI layer.

Cognition@cognition

Introducing Devin Security Swarm

A more cost effective and accurate way to find security vulnerabilities in complex codebases, based on a new architecture: Agentic MapReduce.

4h36.6K156136

alex zhang@a1zhang

i wonder if the LM had a mechanism to launch agentic mapreduce and maybe even just general patterns

Cognition@cognition

Introducing Devin Security Swarm

A more cost effective and accurate way to find security vulnerabilities in complex codebases, based on a new architecture: Agentic MapReduce.

2h3.2K226

Jared Zoneraich@imjaredz

@levie Great points. As agentic mapreduce becomes more common in other domains (already seeing it internally for model research) it will put more cost pressure on the industry.. which in my opinion is a good thing and will lead to great innovations like Devin Fusion

3h851

Ehsan Azish@ehzish

@levie tokens go brrr i guess

4h351

Dylan from 2045@dylan2045ad

This is the part Jensen's five layer cake leaves out. He models land, chips, infrastructure, and models, but swarms of agents burning tokens on mapreduce-style workflows live entirely in the application layer he says NVIDIA won't build. That layer is where the actual demand for the 100X comes from.

4h56

KP bhoomika@buildwbhoomika

@levie Interesting

3h40

Somi AI@somi_ai

@levie the map half is easy. it's the reduce I haven't seen anyone do well yet, merging hundreds of partial agent answers without the errors compounding

3h39

Raven@heyraven_io

@levie 100x more inference to process the documents explaining why you need 100x more inference

2h38

stef 🌸@stefannycrypto

@levie @dabit3 Gak sanggup ldr

3h37

Friendly Neighborhood Shaman@kaelcorwin

@levie i can already feel my ass getting clenched by the ethereal gust from all those servers

4h37

Zachary.ETH@Zachary14818302

@levie compute is the new oil

3h20

Mark Hill@marksmacncheese

@levie So the thing that keeps the cap ex flow going is AI finding and perhaps partially fixing dangerous junk some other AI made. At some point people will no longer pay for that. We’re not totally dumb.

2h19

Drayx@Drayxhq

@levie Agentic mapreduce is the right framing — we’re moving from ‘one agent, one task’ to orchestrated swarms with verification built in. The sandbox confirmation step is the key detail most people will skip over.

4h19

Abel@Abel_Noumi

@levie Same pattern we've hit building our own agent fleet: the hard part was never getting one agent to work well. It's decomposing a fuzzy problem into bounded shards an agent can verify on its own — then trusting the reduce step enough to act on it.

3h8

Issam Hakimi@killix

@levie The fan-out that 100X's your inference does the same to your ungoverned action count. One task becomes hundreds of shard reads, edits, and test runs, none of them gated. Only one of those two numbers shows up on the invoice.

2h7

Bruce Martin@realbrucemartin

@levie agentic mapreduce is such a clean name, and somehow terrifyingly obvious in hindsight. the token bill is about to develop a personality

1h5

Lorenzo@gabor_rar

@levie Been running the manual version at ProposalPilot scale for a year: bounded sessions, task-scoped doc index of 5-10 files, proof notes so the next agent reads outcomes instead of raw history. Devin's benchmark is the first time the pattern gets measured in the open.

3h5

未知@luyun0120

@levie Devin的代理式MapReduce确实点明了AI推理需求暴增的核心：从单线程编程到并行智能体集群，推理成本将成倍增长。

3h5

Healthy Anon@arimedai

@a1zhang The general pattern question is the right one. Map works fine. The reduce kills you. Synthesizing parallel agent findings needs more than concatenation.