/Tech3h ago

Yo Shavit, OpenAI frontier AI safety policy lead, says Frontier Model Forum safety guidelines lack precise operational details

Missing external forcing functions prevent more detailed safety standards.

447031K

Original post

In case you’re interested, the FMF has the closest thing to public guidance documents on frontier safety evaluation standards. https://www.frontiermodelforum.org/publications/#technical-reports They exist! They’re inherently not that detailed because there’s been no forcing function to get stakeholders to compromise on exact operationalization, plus an inherent need for flexibility given the rapid shifts in evaluation best practices.

Yo Shavit@yonashav

Josh, I think you might be operating under some bad information.

This seems to be a misunderstanding of why FLOPs have been included in every attempt at safety legislation. There’s an inherent need to ask “to what types of models do you apply the standards”, lest we run cyber evals on every academic lab’s 50M param pretrain.

Also, it’s not that people didn’t propose standards. See eg Transluce’s draft work. But to get broad acceptance of specific standards, you would need the labs to be willing to agree, which they strongly preferred to avoid doing to not tie their future hands wrt regulatory constraints under conditions they couldn’t foresee. That’s why every lab safety framework is vague on exact threshold operationalization.

11:15 AM · Jun 26, 2026 · 566 Views

Sentiment

Users support the Frontier Model Forum guidance on frontier safety evaluations because they view labs' refusal to lock into fixed eval thresholds as justified amid changing best practices.

Pos

100.0%

Neg

0.0%

1 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Related links

Publications - Frontier Model Forum

FRONTIER MODEL FORUMVia

#683

Posts from X

Most Activity

VIEWS281BOOKMARKS1LIKES16REPLIES2

Yo Shavit@yonashav

Also, JTBC, I think the labs’ desire not to bind themselves to specific eval operationalization thresholds was justified! Eval best practices have changed every few months for the last 3 years; if you locked them at any given point, you’d be bound to a really misleading heuristic.

A fixed solution like you’re hoping for isn’t possible, according to every AI researcher I’ve talked to. The solution is to have an expert body continuously accrediting the soundness of the science behind new evals. This is the point of CAISI/UK AISI.

Yo Shavit@yonashav

2h281161