/Tech3h ago

ARC Prize delays verified ARC-AGI evaluations of Anthropic's Fable 5 model over data-retention concerns

Organizers are negotiating safe evaluation terms with Anthropic.

562K67173220.7K

#435

Original post

ARC Prize@arcprize

We had early access to Anthropic’s Fable 5, but did not run verified Semi-Private ARC-AGI-1/2/3 evals due to their new data-retention terms for Mythos-class models.

We’re working with Anthropic to keep ARC verification data private. Scores will come once we can run them safely.

10:28 AM · Jun 9, 2026 · 218.4K Views

/Tech3h ago

ARC Prize delays verified ARC-AGI evaluations of Anthropic's Fable 5 model over data-retention concerns

Organizers are negotiating safe evaluation terms with Anthropic.

562K67173220.7K

#435

Original post

ARC Prize@arcprize

We had early access to Anthropic’s Fable 5, but did not run verified Semi-Private ARC-AGI-1/2/3 evals due to their new data-retention terms for Mythos-class models.

We’re working with Anthropic to keep ARC verification data private. Scores will come once we can run them safely.

10:28 AM · Jun 9, 2026 · 218.4K Views

Sentiment

Positive users thank ARC Prize for opposing Anthropic's data retention terms on Mythos models for privacy reasons, while negative users call the policy ridiculous, a slippery slope, or sabotage.

Pos

47.8%

Neg

52.2%

23 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS11.3KBOOKMARKS17LIKES103RETWEETS3REPLIES3

Gonçalo Canhoto 🇵🇹@goncalo_canhoto

@arcprize Yep, this seems like a big deal for enterprises (?)

1d11.3K10317

Matt Mazur@mhmazur

Here's a link to Anthropic's new support doc about data retention practices for Mythos-class models for anyone curious:

https://support.claude.com/en/articles/15425996-data-retention-practices-for-mythos-class-models

From the intro paragraph:

> To ensure we’re responsibly deploying Mythos-class models, we are requiring limited data retention and review as part of our safety work. Prompts submitted to, and outputs generated by, Mythos-class models are retained for 30 days for trust and safety purposes, on every platform where these models are offered.

1d7.2K2910

Guilherme O'Tina@guilhermeotina

@arcprize ARC is one of the few places left doing independent verification at this level. if their test data cant stay private under mythos terms, the only public fable scores come from anthropics own blog. that shifts what any benchmark claim actually means

19h2.3K27

Casper Hansen@casper_hansen_

@arcprize How should we read this? If anyone ever uses their new model, they automatically have the right to use your data?

22h3.6K131

Conor@jconorgrogan

@arcprize Thank you for standing up against this draconian closed bs

21h1.6K21

Gonçalo Canhoto 🇵🇹@goncalo_canhoto

@1slimewell @arcprize Yep, from my understanding, I think you need to turn on Retention if you want to use it through API

1d53611

maxwell@1slimewell

@goncalo_canhoto @arcprize Holy shit is this including the API?

1d6083

Greg Kamradt@GregKamradt

@casper_hansen_ @arcprize No, that isn't the case per their docs

They explain it well (over 3 docs though) here

https://support.claude.com/en/articles/15425996-data-retention-practices-for-mythos-class-models

21h5766

Frosty40@FrostForger

Anthropic evil. Will sabatoge yoru research. this is THE MOST disgusting day int he history of LLM and humanity. a truly important and revolting day that everyone should be paying attention to. Arcprize is either frontier research, or its not. and if its frontier research, claude will sabotage your project so YOU NEED TO WAKE UP @arcprize and put them on the DO NOT USE list. they have in thier TOS they will SABOTAGE YOUR WORK AND CHARGE YOU. if thats nto the biggest fucking wake up call to the "AGI" world you guys are sleeping at the wheel. Sorry to be so vocal here but YOU NEED TO PICK A LANE NOW. either this is a toy, doesnt matter and we should be making mario, or this is real fuckgin work. the ball is in your court, and it was forced on you by an evil alignment. this is much bigger than any single submission, this is literally our childrens future on the line. you need to see this for what it is, and i assume this post is the first opening salvo in what will be a confirmation of their reprehensable, innapropriate behavior. DO YOU HAVE ANY FUCKGIN CLUE HOW MUCH SHIT I WOULD BE IN SELLING ARCPRIZE MODS THAT SECRETOLY SABOTAGED A USERS ABILITY TO COMPETE? That is literally fraude, and theft, and probably a couple other things. Why do i see this as black and white?

14h1.5K5

David Hendrickson@TeksEdge

@arcprize They said they made it "safe". Did that impact ZDR policies?

1d5.1K3

Steven Sinofsky@stevesi

Here’s another rug pull.

ARC Prize@arcprize

We had early access to Anthropic’s Fable 5, but did not run verified Semi-Private ARC-AGI-1/2/3 evals due to their new data-retention terms for Mythos-class models.

We’re working with Anthropic to keep ARC verification data private. Scores will come once we can run them safely.

2h94840

Trelis Research@TrelisResearch

@arcprize 😢

1d1.2K4

Max For AI@MaxForAI

@arcprize Do you expect his score to improve significantly?

1d3K2

seijin@david_saint_

@arcprize That means never? Because they say they won't train on the data, but you not running it means you don't trust them.

1d2.7K2

kanver@kanver_

@arcprize Will you evaluate current leading Chinese models, such as Qwen 3.7 Max, Kimi K2.6 or Minimax 3 on Arc-AGI-1/2? Would be useful

1d3.2K1

Luci Pars@parsluci

@arcprize

1d1.8K1

Matija Grcic@matijagrcic

@arcprize interesting way to avoid arc benchmark.

22h3642

Eclipse 🌖@ECLresearch

@arcprize Good to see teams pushing for eval integrity instead of chasing PR scores. ARC's data terms for frontier models are a real bottleneck—curious how Anthropic responds on data retention.

1d1K1

Name@canonicalmodel

@arcprize Why does semi-private exist and why isn't that unfair to others?

22h3.1K

Yu Yoshimuta@YuYoshimuta

@arcprize I appreciate your efforts! Looking forward to seeing results on ARC AGI 3

6h5601