/Tech4h ago

Jonas Geiping's updated Claudini benchmark finds Kimi-2.6 outperforms Opus 4.6 and GPT-5.5 at autonomously refining AI jailbreaks

Kimi-2.6 achieved attack success rates of 100% and 92%.

430192.2K

#877

Original post

Jeremy Cohen@deepcohen#877inTech

@jonasgeiping I can't tell if this is a play on Houdini or Carlini

Jonas Geiping@jonasgeiping

We recently updated Claudini (our autoresearch test where agents autonomously improve jailbreak algorithms), no fable results for now (...), but surprisingly Kimi-2.6 has entirely caught up, surpassing Opus 4.6 on this task - Kimi 2.6 is quite a strong and persistent attacker.

(more details below)

10:56 AM · Jun 16, 2026 · 165 Views

Sentiment

Users are excited about Kimi K2.6 surpassing Opus 4.6 in autonomous jailbreak attacks because of its incredible naming and the super awesome results from tools like evo running on it.

Pos

100.0%

Neg

0.0%

2 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS55LIKES1

Alok Bishoyi@alokbishoyi97

@jonasgeiping yes! a lot of users of evo that run on top of Kimi have also claimed to have shown super awesome results.

http://github.com/evo-hq/evo

3h551

Eitan Borgnia@EBorgnia

@jonasgeiping incredible naming hahaah

4h311