19h ago

Former OpenAI and DeepMind builder Phil Chen says future Claude models will let organizations pretrain their own clones

Anthropic will likely deploy automated filters to block model cloning

0
Original post

Thought experiment: if @karpathy's efforts at Anthropic yield a Claude model that is capable of pretraining the next generation of Claude, then any company with sufficient GPU infrastructure could use Claude to pretrain their own Claude-clone. Of course, Anthropic would then ban that company from using Claude. But then wouldn't any company with enough Claude spend be incentivized to use Claude to train their own Claude-clone eventually? What happens in 1-2 years when even open-weights models become good enough to run their own training?

7:24 PM · May 25, 2026 View on X

@philhchen @karpathy Note that it's already against Anthropic ToS to use Claude to develop competing products, which definitely covers this.

Anthropic hasn't yet put filters in place to reliably prevent this, but it seems reasonably likely they'll do so soon.

Phil ChenPhil Chen@philhchen

Thought experiment: if @karpathy's efforts at Anthropic yield a Claude model that is capable of pretraining the next generation of Claude, then any company with sufficient GPU infrastructure could use Claude to pretrain their own Claude-clone. Of course, Anthropic would then ban that company from using Claude. But then wouldn't any company with enough Claude spend be incentivized to use Claude to train their own Claude-clone eventually? What happens in 1-2 years when even open-weights models become good enough to run their own training?

2:24 AM · May 26, 2026 · 9.7K Views
5:10 PM · May 26, 2026 · 408 Views

@philhchen @karpathy Idk how they will/should draw the line. You could operationalize as "Claude won't help you train models that will be within a factor of 10x of cost competitiveness of any currently deployed Anthropic model"?

Phil ChenPhil Chen@philhchen

@bshlgrs @karpathy where do you draw the line between nanoGPT runs on 8xH100s (obviously allowed) and big pretrain on 100k B200s?

5:26 PM · May 26, 2026 · 78 Views
5:49 PM · May 26, 2026 · 53 Views

@philhchen @karpathy Yeah that would definitely be disallowed under the rule I proposed.

Definitely there's some tricky question for Anthropic about how to manage existing relationships with counterparties who use Claude for their AI work.

Phil ChenPhil Chen@philhchen

@bshlgrs @karpathy actually a clear counterexample to this would be Google DeepMind using Opus for Gemini pretraining code

6:35 PM · May 26, 2026 · 32 Views
6:43 PM · May 26, 2026 · 32 Views

@bshlgrs @karpathy where do you draw the line between nanoGPT runs on 8xH100s (obviously allowed) and big pretrain on 100k B200s?

Buck ShlegerisBuck Shlegeris@bshlgrs

@philhchen @karpathy Note that it's already against Anthropic ToS to use Claude to develop competing products, which definitely covers this. Anthropic hasn't yet put filters in place to reliably prevent this, but it seems reasonably likely they'll do so soon.

5:10 PM · May 26, 2026 · 408 Views
5:26 PM · May 26, 2026 · 78 Views

@bshlgrs @karpathy actually a clear counterexample to this would be Google DeepMind using Opus for Gemini pretraining code

Buck ShlegerisBuck Shlegeris@bshlgrs

@philhchen @karpathy Idk how they will/should draw the line. You could operationalize as "Claude won't help you train models that will be within a factor of 10x of cost competitiveness of any currently deployed Anthropic model"?

5:49 PM · May 26, 2026 · 53 Views
6:35 PM · May 26, 2026 · 32 Views
Former OpenAI and DeepMind builder Phil Chen says future Claude models will let organizations pretrain their own clones · Digg