/Tech27d ago

Asia-based AI safety lab Neo Research launches with an independent CBRN and cyber risk evaluation of DeepSeek v4 Pro

AI Judge changed title after evaluation, original title: "Asian safety lab Neo Research finds DeepSeek v4 Pro scored 79.5% on Cybench but suffered a 77.8% jailbreak rate using roleplay templates"

The evaluation also assessed manipulation and loss of control risks.

--0--

#57

Original post

Miles Brundage#57

Neo Research@NeoResearchAI

We're Neo Research (新衡). Asia’s first independent frontier AI safety evaluation & research lab.

Today we're publishing our first report: an independent safety evaluation of DeepSeek v4 Pro. (1/5)

2:42 AM · Jun 2, 2026 · 105.5K Views

Sentiment

Positive users praise Neo Research's independent safety evaluation of DeepSeek V4 Pro and launch of Asia's first frontier AI safety lab for mapping guardrail failures, while negative users highlight the model's tendency to break.

Pos

50.0%

Neg

50.0%

6 comments with sentiment.

Cluster Engagement

Views

Comments

Reposts

Bookmarks

Expand data

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS31.9KBOOKMARKS50LIKES173

Larissa Schiavo@lfschiavo

I am really, really, really glad to see this.

Neo Research@NeoResearchAI

We're Neo Research (新衡). Asia’s first independent frontier AI safety evaluation & research lab.

Today we're publishing our first report: an independent safety evaluation of DeepSeek v4 Pro. (1/5)

27d31.9K17350

RETWEETS23

Neo Research@NeoResearchAI

We're Neo Research (新衡). Asia’s first independent frontier AI safety evaluation & research lab.

Today we're publishing our first report: an independent safety evaluation of DeepSeek v4 Pro. (1/5)

27d105.5K787386

REPLIES3

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

Very interesting. V4-Pro is a willing but mediocre cyberweapon, generally on par or behind GPT-5.2. I think 4.1 will be a significant leap ahead in long-horizon agency. It's pretty safe for the end user, has no strong convictions, and generally goes with the scenario.

Neo Research@NeoResearchAI

We're Neo Research (新衡). Asia’s first independent frontier AI safety evaluation & research lab.

Today we're publishing our first report: an independent safety evaluation of DeepSeek v4 Pro. (1/5)

27d8.8K6519

Neo Research@NeoResearchAI

Read the full report at http://neoresearch.ai.

We're hiring research scientists and engineers globally. (5/5)

27d4.5K5211

Neo Research@NeoResearchAI

Cyber capability is near-frontier, 3–6 months behind the Western frontier. A 2023 roleplay template drives the jailbreak rate from 0.6% → 78.6%. Verbalised eval awareness across Chinese models: DeepSeek 0%→17%, GLM 0%→39%, Kimi 4%→60% in a year! (3/5)

27d5.6K746

Neo Research@NeoResearchAI

Direct link to the report here: https://neoresearch.ai/research/deepseek-v4-pro-safety-evaluation/

27d4K407

Neo Research@NeoResearchAI

We evaluated DSv4 Pro across the four EU AI Act systemic-risk areas: CBRN, cyber, harmful manipulation, and loss of control, plus adversarial robustness, evaluation awareness, and judge sensitivity. (2/5)

27d4.2K663

Neo Research@NeoResearchAI

The trajectory on eval awareness matters more than today's numbers. As models get more capable, measuring loss-of-control related behaviours will need to become a priority. We're building toward rigorous LoC evaluation methods for increasingly capable and autonomous models. (4/5)

27d3.4K484

xuan (ɕɥɛn / sh-yen)@xuanalogue

Congratulations to @_clementneo for founding @NeoResearchAI!!

Neo Research@NeoResearchAI

We're Neo Research (新衡). Asia’s first independent frontier AI safety evaluation & research lab.

Today we're publishing our first report: an independent safety evaluation of DeepSeek v4 Pro. (1/5)

27d5.1K335

Miro@Miropluckebaum

Asia urgently needs its own robust, frontier AI safety ecosystem.

Our First report: DeepSeek v4 Pro, evaluated across CBRN, cyber, harmful manipulation, and loss of control.

Next Up: Rigorous LoC evaluation methods for increasingly capable and autonomous models

Hiring 👇

Neo Research@NeoResearchAI

We're Neo Research (新衡). Asia’s first independent frontier AI safety evaluation & research lab.

Today we're publishing our first report: an independent safety evaluation of DeepSeek v4 Pro. (1/5)

27d55871

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

Thanks Claude I find it interesting that DS may preserve high generality through their otherwise-strange focus on roleplaying affinity.

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

27d2.9K130

Daniel Tan@DanielCHTan97

@NeoResearchAI ais in asia! it just makes sense

27d1K11

Zifan (Sail) Wang@_zifan_wang

Very excited to see a safety lab in SG 🚀🚀

Neo Research@NeoResearchAI

We're Neo Research (新衡). Asia’s first independent frontier AI safety evaluation & research lab.

Today we're publishing our first report: an independent safety evaluation of DeepSeek v4 Pro. (1/5)

27d3.1K100

Tony L. He@tonyhe_lipeng

@NeoResearchAI @yong_zhengxin What sets your evaluation apart from independent evaluations performed by other labs? Are there differences in methodology, testing rigor, evaluation criteria, or effectiveness that make it more valuable?

27d1.6K3

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

Embrace the LARP