/AI1d ago

ARC Prize halts private ARC-AGI evaluations of Anthropic's Fable 5 over data retention policy conflicts

Anthropic's Mythos-class models do not support zero-data retention

682.6K84219288.5K

#421

Original post

Seth Lazar#1070

ARC Prize@arcprize

We had early access to Anthropic’s Fable 5, but did not run verified Semi-Private ARC-AGI-1/2/3 evals due to their new data-retention terms for Mythos-class models.

We’re working with Anthropic to keep ARC verification data private. Scores will come once we can run them safely.

10:28 AM · Jun 9, 2026 · 215.9K Views

/AI1d ago

ARC Prize halts private ARC-AGI evaluations of Anthropic's Fable 5 over data retention policy conflicts

Anthropic's Mythos-class models do not support zero-data retention

682.6K84219288.5K

#421

Original post

Seth Lazar#1070

ARC Prize@arcprize

We had early access to Anthropic’s Fable 5, but did not run verified Semi-Private ARC-AGI-1/2/3 evals due to their new data-retention terms for Mythos-class models.

We’re working with Anthropic to keep ARC verification data private. Scores will come once we can run them safely.

10:28 AM · Jun 9, 2026 · 215.9K Views

Sentiment

Positive users praise the ARC Prize for delaying Fable 5 ARC-AGI evaluations to uphold integrity against Anthropic's restrictive data terms, whereas negative users voice disappointment at the delay and criticize the terms as draconian.

Pos

47.2%

Neg

52.8%

14 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS39.6KBOOKMARKS17LIKES171REPLIES6

Greg Kamradt@GregKamradt

We had access to Fable over the past few days

We were able to run it against public data but couldn't do semi-private (our private verification set) due to the new data retention policies

We're working with them to figure out a way to keep verification data private to ensure that ARC-AGI benchmarks continue to give us signal

fwiw, it did well on public data - we'll share once we're able to run semi private

ARC Prize@arcprize

We had early access to Anthropic’s Fable 5, but did not run verified Semi-Private ARC-AGI-1/2/3 evals due to their new data-retention terms for Mythos-class models.

We’re working with Anthropic to keep ARC verification data private. Scores will come once we can run them safely.

1d39.6K17117

RETWEETS8

Mark Saroufim@marksaroufim

The “Frontier lab” label has conveniently expanded to mean almost any team writing software.

An inference startup writing kernels, any sass company building evals, a student researcher working on parallelism

Now they’re all expected to accept being snooped on and nerfed.

ARC Prize@arcprize

We had early access to Anthropic’s Fable 5, but did not run verified Semi-Private ARC-AGI-1/2/3 evals due to their new data-retention terms for Mythos-class models.

We’re working with Anthropic to keep ARC verification data private. Scores will come once we can run them safely.

10h8K16813

Gonçalo Canhoto 🇵🇹@goncalo_canhoto

@arcprize Yep, this seems like a big deal for enterprises (?)

23h11.3K10317

Mike Knoop@mikeknoop

Anthropic's new data retention policies don't mesh well with enterprise users who need zero-data retention (ZDR). ARC also leverages ZDR to run our benchmarks without risk of exposure of private dataset.

ARC Prize@arcprize

We had early access to Anthropic’s Fable 5, but did not run verified Semi-Private ARC-AGI-1/2/3 evals due to their new data-retention terms for Mythos-class models.

We’re working with Anthropic to keep ARC verification data private. Scores will come once we can run them safely.

23h11.9K1189

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

Insanely short-termist attitude

ARC Prize@arcprize

We had early access to Anthropic’s Fable 5, but did not run verified Semi-Private ARC-AGI-1/2/3 evals due to their new data-retention terms for Mythos-class models.

We’re working with Anthropic to keep ARC verification data private. Scores will come once we can run them safely.

19h12.1K1019

Matt Mazur@mhmazur

Here's a link to Anthropic's new support doc about data retention practices for Mythos-class models for anyone curious:

https://support.claude.com/en/articles/15425996-data-retention-practices-for-mythos-class-models

From the intro paragraph:

> To ensure we’re responsibly deploying Mythos-class models, we are requiring limited data retention and review as part of our safety work. Prompts submitted to, and outputs generated by, Mythos-class models are retained for 30 days for trust and safety purposes, on every platform where these models are offered.

23h7.2K2910

Guilherme O'Tina@guilhermeotina

@arcprize ARC is one of the few places left doing independent verification at this level. if their test data cant stay private under mythos terms, the only public fable scores come from anthropics own blog. that shifts what any benchmark claim actually means

18h2.3K27

Greg Kamradt@GregKamradt

For previous models we had zero-data retention in place but that isn't the case anymore

I'm confident there is a solution going forward

Greg Kamradt@GregKamradt

We had access to Fable over the past few days

We were able to run it against public data but couldn't do semi-private (our private verification set) due to the new data retention policies

We're working with them to figure out a way to keep verification data private to ensure that ARC-AGI benchmarks continue to give us signal

fwiw, it did well on public data - we'll share once we're able to run semi private

1d2.1K191