/Tech3h ago

ARC Prize delays verified ARC-AGI evaluations of Anthropic's Fable 5 model over data-retention concerns

Organizers are negotiating safe evaluation terms with Anthropic.

562K67173220.7K
Original post
ARC Prize@arcprize

We had early access to Anthropic’s Fable 5, but did not run verified Semi-Private ARC-AGI-1/2/3 evals due to their new data-retention terms for Mythos-class models.

We’re working with Anthropic to keep ARC verification data private. Scores will come once we can run them safely.

10:28 AM · Jun 9, 2026 · 218.4K Views
Sentiment

Positive users thank ARC Prize for opposing Anthropic's data retention terms on Mythos models for privacy reasons, while negative users call the policy ridiculous, a slippery slope, or sabotage.

Pos
47.8%
Neg
52.2%
23 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS11.3KBOOKMARKS17LIKES103RETWEETS3REPLIES3

@arcprize Yep, this seems like a big deal for enterprises (?)

1dViews 11.3KLikes 103Bookmarks 17
Matt Mazur@mhmazur

Here's a link to Anthropic's new support doc about data retention practices for Mythos-class models for anyone curious:

https://support.claude.com/en/articles/15425996-data-retention-practices-for-mythos-class-models

From the intro paragraph:

> To ensure we’re responsibly deploying Mythos-class models, we are requiring limited data retention and review as part of our safety work. Prompts submitted to, and outputs generated by, Mythos-class models are retained for 30 days for trust and safety purposes, on every platform where these models are offered.

1dViews 7.2KLikes 29Bookmarks 10
Guilherme O'Tina@guilhermeotina

@arcprize ARC is one of the few places left doing independent verification at this level. if their test data cant stay private under mythos terms, the only public fable scores come from anthropics own blog. that shifts what any benchmark claim actually means

19hViews 2.3KLikes 27
Casper Hansen@casper_hansen_

@arcprize How should we read this? If anyone ever uses their new model, they automatically have the right to use your data?

22hViews 3.6KLikes 13Bookmarks 1
Conor@jconorgrogan

@arcprize Thank you for standing up against this draconian closed bs

21hViews 1.6KLikes 21

@1slimewell @arcprize Yep, from my understanding, I think you need to turn on Retention if you want to use it through API

1dViews 536Likes 1Bookmarks 1
maxwell@1slimewell

@goncalo_canhoto @arcprize Holy shit is this including the API?

1dViews 608Likes 3
Greg Kamradt@GregKamradt

@casper_hansen_ @arcprize No, that isn't the case per their docs

They explain it well (over 3 docs though) here

https://support.claude.com/en/articles/15425996-data-retention-practices-for-mythos-class-models

21hViews 576Likes 6
Frosty40@FrostForger

Anthropic evil. Will sabatoge yoru research. this is THE MOST disgusting day int he history of LLM and humanity. a truly important and revolting day that everyone should be paying attention to. Arcprize is either frontier research, or its not. and if its frontier research, claude will sabotage your project so YOU NEED TO WAKE UP @arcprize and put them on the DO NOT USE list. they have in thier TOS they will SABOTAGE YOUR WORK AND CHARGE YOU. if thats nto the biggest fucking wake up call to the "AGI" world you guys are sleeping at the wheel. Sorry to be so vocal here but YOU NEED TO PICK A LANE NOW. either this is a toy, doesnt matter and we should be making mario, or this is real fuckgin work. the ball is in your court, and it was forced on you by an evil alignment. this is much bigger than any single submission, this is literally our childrens future on the line. you need to see this for what it is, and i assume this post is the first opening salvo in what will be a confirmation of their reprehensable, innapropriate behavior. DO YOU HAVE ANY FUCKGIN CLUE HOW MUCH SHIT I WOULD BE IN SELLING ARCPRIZE MODS THAT SECRETOLY SABOTAGED A USERS ABILITY TO COMPETE? That is literally fraude, and theft, and probably a couple other things. Why do i see this as black and white?

14hViews 1.5KLikes 5

@arcprize They said they made it "safe". Did that impact ZDR policies?

1dViews 5.1KLikes 3

Here’s another rug pull.

ARC Prize@arcprize

We had early access to Anthropic’s Fable 5, but did not run verified Semi-Private ARC-AGI-1/2/3 evals due to their new data-retention terms for Mythos-class models.

We’re working with Anthropic to keep ARC verification data private. Scores will come once we can run them safely.

2hViews 948Likes 4Bookmarks 0
Max For AI@MaxForAI

@arcprize Do you expect his score to improve significantly?

1dViews 3KLikes 2
seijin@david_saint_

@arcprize That means never? Because they say they won't train on the data, but you not running it means you don't trust them.

1dViews 2.7KLikes 2
kanver@kanver_

@arcprize Will you evaluate current leading Chinese models, such as Qwen 3.7 Max, Kimi K2.6 or Minimax 3 on Arc-AGI-1/2? Would be useful

1dViews 3.2KLikes 1
Luci Pars@parsluci

@arcprize

1dViews 1.8KLikes 1
Matija Grcic@matijagrcic

@arcprize interesting way to avoid arc benchmark.

22hViews 364Likes 2
Eclipse 🌖@ECLresearch

@arcprize Good to see teams pushing for eval integrity instead of chasing PR scores. ARC's data terms for frontier models are a real bottleneck—curious how Anthropic responds on data retention.

1dViews 1KLikes 1
Name@canonicalmodel

@arcprize Why does semi-private exist and why isn't that unfair to others?

22hViews 3.1K
Yu Yoshimuta@YuYoshimuta

@arcprize I appreciate your efforts! Looking forward to seeing results on ARC AGI 3

6hViews 560Likes 1
Load more posts