19h ago

Former OpenAI and DeepMind builder Phil Chen says future Claude models will let organizations pretrain their own clones

Anthropic will likely deploy automated filters to block model cloning

215301410.3K

——0——

Original post

Thought experiment: if @karpathy's efforts at Anthropic yield a Claude model that is capable of pretraining the next generation of Claude, then any company with sufficient GPU infrastructure could use Claude to pretrain their own Claude-clone. Of course, Anthropic would then ban that company from using Claude. But then wouldn't any company with enough Claude spend be incentivized to use Claude to train their own Claude-clone eventually? What happens in 1-2 years when even open-weights models become good enough to run their own training?

7:24 PM · May 25, 2026

#1243Buck Shlegeris@BSHLGRS

@philhchen @karpathy Note that it's already against Anthropic ToS to use Claude to develop competing products, which definitely covers this.

Anthropic hasn't yet put filters in place to reliably prevent this, but it seems reasonably likely they'll do so soon.

Phil Chen@philhchen

2:24 AM · May 26, 2026 · 9.7K Views

5:10 PM · May 26, 2026 · 408 Views

#1243Buck Shlegeris@BSHLGRS

@philhchen @karpathy Idk how they will/should draw the line. You could operationalize as "Claude won't help you train models that will be within a factor of 10x of cost competitiveness of any currently deployed Anthropic model"?

Phil Chen@philhchen

@bshlgrs @karpathy where do you draw the line between nanoGPT runs on 8xH100s (obviously allowed) and big pretrain on 100k B200s?

5:26 PM · May 26, 2026 · 78 Views

5:49 PM · May 26, 2026 · 53 Views

#1243Buck Shlegeris@BSHLGRS

@philhchen @karpathy Yeah that would definitely be disallowed under the rule I proposed.

Definitely there's some tricky question for Anthropic about how to manage existing relationships with counterparties who use Claude for their AI work.

Phil Chen@philhchen

@bshlgrs @karpathy actually a clear counterexample to this would be Google DeepMind using Opus for Gemini pretraining code

6:35 PM · May 26, 2026 · 32 Views

6:43 PM · May 26, 2026 · 32 Views

#1468Phil Chen@PHILHCHEN

@bshlgrs @karpathy where do you draw the line between nanoGPT runs on 8xH100s (obviously allowed) and big pretrain on 100k B200s?

Buck Shlegeris@bshlgrs

@philhchen @karpathy Note that it's already against Anthropic ToS to use Claude to develop competing products, which definitely covers this. Anthropic hasn't yet put filters in place to reliably prevent this, but it seems reasonably likely they'll do so soon.

5:10 PM · May 26, 2026 · 408 Views

5:26 PM · May 26, 2026 · 78 Views

#1468Phil Chen@PHILHCHEN

@bshlgrs @karpathy actually a clear counterexample to this would be Google DeepMind using Opus for Gemini pretraining code

Buck Shlegeris@bshlgrs

5:49 PM · May 26, 2026 · 53 Views

6:35 PM · May 26, 2026 · 32 Views

Former OpenAI and DeepMind builder Phil Chen says future Claude models will let organizations pretrain their own clones

Sentiment

Cluster engagement