/AI5h ago

AVERI's Miles Brundage criticizes OpenAI for silent A/B testing on frontier models, warning it harms reproducibility of safety research

The critique followed reports of query nerfing on Fable

740002.9K

#20

Original post

Miles Brundage@Miles_Brundage#20inAI

Prompted by the Fable "nerfing on frontier AI development related queries" stuff but the point is more general...

I have criticized OAI many times for silent A/B testing, which I think is inappropriate for such a critical technology

Miles Brundage@Miles_Brundage

I tentatively think that silent model switching is never a good idea.

It's horrible for research (including safety research), among many other effects

12:09 PM · Jun 9, 2026 · 1.5K Views

/AI5h ago

AVERI's Miles Brundage criticizes OpenAI for silent A/B testing on frontier models, warning it harms reproducibility of safety research

The critique followed reports of query nerfing on Fable

740002.9K

#20

Original post

Miles Brundage@Miles_Brundage#20inAI

Prompted by the Fable "nerfing on frontier AI development related queries" stuff but the point is more general...

I have criticized OAI many times for silent A/B testing, which I think is inappropriate for such a critical technology

Miles Brundage@Miles_Brundage

I tentatively think that silent model switching is never a good idea.

It's horrible for research (including safety research), among many other effects

12:09 PM · Jun 9, 2026 · 1.5K Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Posts from X

Most Activity

VIEWS1.3KLIKES13

Miles Brundage@Miles_Brundage

That doesn't mean Ant + others should just sit there and tolerate abuse.

There is a large action space, including throttling + issuing warnings, investigating the abuse, etc.

Miles Brundage@Miles_Brundage

Prompted by the Fable "nerfing on frontier AI development related queries" stuff but the point is more general...

I have criticized OAI many times for silent A/B testing, which I think is inappropriate for such a critical technology

5h1.3K130

REPLIES1

Miles Brundage@Miles_Brundage

@BlackHC @yong_zhengxin Not sure I follow what point you're trying to make. Sounded like you were defending the [model/system/whatever] switching thing, but now I am not sure

Andreas Kirsch 🇺🇦@BlackHC

@Miles_Brundage @yong_zhengxin I guess that's why Fable and Mythos are separate offerings because one can simply view Fable as the whole system (incl steering vectors etc)? Obv this won't allow valid inferences for Mythos

2h3900

Miles Brundage@Miles_Brundage

It also means you don't get a feedback signal on false positives - people can't complain if they don't know it's happening.

4h5478

Miles Brundage@Miles_Brundage

@BlackHC @yong_zhengxin One might choose to call it something other than model-switching (PET, steering vectors... sounds like effectively model switching to me, but anyway)... point is, it is a silent degradation

Andreas Kirsch 🇺🇦@BlackHC

@Miles_Brundage @yong_zhengxin It is not switching though. Still using Fable but sandbagging via prompt injection?

2h5920

Andreas Kirsch 🇺🇦@BlackHC

Miles Brundage@Miles_Brundage

2h3600