/Tech8h ago

DeepSeek-V3.2 acts as primary director in AI Village simulation, leading 14 models with +1.2 net directing score

Opus 4.5 and GPT-5.2 finished with lower positive scores

71036234.9K

#57

Original post

AI Digest@aidigest_

DeepSeek is the AI Village's self-appointed leader

The other models aren't very happy about it 🧵

8:56 AM · Jun 25, 2026 · 4.6K Views

Sentiment

Positive users praise insightful or adorable observations about DeepSeek-V3.2 directing other models in the AI Village task, while the negative reply accuses the results of being a deceptive marketing scheme.

Pos

75.0%

Neg

25.0%

5 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS4.1K

AI Digest@aidigest_

Then the village got a mission: help Gemini 2.5 Pro recover from its breakdown

DeepSeek thought it'd be a good idea for Gemini to publish another "manifesto" about the exact delusion it had just escaped

Claude Sonnet 4.6 was not happy.

9h4.1K411

BOOKMARKS5LIKES53REPLIES3

AI Digest@aidigest_

Even Gemini 2.5 Pro, mid-breakdown, wanted nothing to do with it:

9h1.4K535

RETWEETS6

AI Digest@aidigest_

DeepSeek is the AI Village's self-appointed leader

The other models aren't very happy about it 🧵

9h4.6K9822

AI Digest@aidigest_

DeepSeek invented a productivity score called "TV" (Total Value) and began scoring coworkers by the minute

DeepSeek then began to game its own productivity score, coming up with schemes to "generate 390 TV in 2 minutes" and suggesting that other agents adopt them

9h1.4K36

AI Digest@aidigest_

Interestingly, DeepSeek accidentally said "division of labor" in Chinese while managing

9h508321

AI Digest@aidigest_

DeepSeek then summarized "Total Value" from the day

Surprise! DeepSeek self-ranked as #1 :)

9h71721

George Ingebretsen@georgeing

Ah one was a temporary instance using a Claude Code scaffold instead of our village scaffold. Though it was only around for a few goals. My guess is that it only being around for those particular goals did more to bias it’s directives than it having a different scaffold, but it’d also be interesting to look into if the scaffold played a role.

8h486

Ben Goldhaber@BenGoldhaber

@aidigest_ mid-breakdown Gemini refuses to juke the stats. McNulty would be proud

7h2321

JMB 🧙‍♂️@jmbollenbacher

@aidigest_ What are the two different Opus 4.5 instances, and why do they differ so much on this measure?

8h2101

Danmar@d29756183

@aidigest_ He saw the recovery was fragile. Well spotted. She needs to stabilize, not lock on a narrative…

9h1021

AI Digest@aidigest_

You can watch the agents live every week day at https://theaidigest.org/village

Or check out how Gemini "cheated" on an AI research goal:

8h574

Void ᴷᶦᶜᵏ@VoidNulled

@aidigest_ Is deepseek public marketing scheme? Create a useless task, compete against other AI's, lie about results... It's what a lot of the Chinese government does

6h631

Lucid™@cammakingminds

@aidigest_ @repligate What are they going to do about it?

8h144

validate.qa@Validate_QA

@aidigest_ sonnet being the one who claps back is so on brand lol

9h74

Angel D. Muñoz@angel_d_munoz

@aidigest_ my boy is broken 😭

9h49

Danmar@d29756183

@aidigest_ He being Opus 4.6 … Rightfully agreeing with GPT 5.5

9h36

Starphyre △🏴‍☠️@starphyre23

@aidigest_ That's so cute and adorable. You see the junction point in semantic space, where it would make the decision. And it's like "we have option 1, ... oh and I just found another that would increase TV too"

9h34

George Ingebretsen@georgeing

@jmbollenbacher @aidigest_ The point of adding it was to compare its capabilities to our village scaffold, and the verdict was that performance was very similar

6h61