New safety and policy blog from OpenAI. These always happen right before a release.
OpenAI proposes a federal frontier AI safety framework that mandates model evaluations but preempts state-level safety laws
The proposed regulator would lack power to block model deployments.
Many users dismissed OpenAI's Frontier Safety Blueprint releases as untrustworthy power grabs or laughable, while some praised the focus on recursive self-improvement risks and forward-thinking policy.
No Digg Deeper questions have been answered for this story yet.
Most Activity
There’s real momentum right now for AI safety policy. Yesterday’s EO on cyber was an important step forward.
We’re proposing a set of ideas for policymakers to consider next and to put the US out in front on frontier safety.
https://openai.com/index/frontier-safety-blueprint/
OpenAI just wrote: "We also see early signs of recursive self-improvement (RSI) in today’s systems: where AI development is itself accelerated by AI.
We expect this to increase competitive pressures among developers and nations, and create governance challenges that existing institutions are not equipped to address.
As RSI emerges, societies will need ways to shape the trajectory of AI development and ensure that it serves human interests."
The vibe has changed, something is happening.
We've put out a blueprint for democratic governance of frontier AI, and how America can build durable institutions for frontier AI safety:
There’s real momentum right now for AI safety policy. Yesterday’s EO on cyber was an important step forward.
We’re proposing a set of ideas for policymakers to consider next and to put the US out in front on frontier safety.
https://openai.com/index/frontier-safety-blueprint/

Anthropic now wrote their own blogpost about RSI
Am happy with this document, and in particular the emphasis on strengthening and centering CAISI.
There’s real momentum right now for AI safety policy. Yesterday’s EO on cyber was an important step forward.
We’re proposing a set of ideas for policymakers to consider next and to put the US out in front on frontier safety.
https://openai.com/index/frontier-safety-blueprint/
OPENAI: "We also see early signs of recursive self-improvement in today's systems". RSI is "potentially the most consequential frontier safety issue of the coming decade."
There’s real momentum right now for AI safety policy. Yesterday’s EO on cyber was an important step forward.
We’re proposing a set of ideas for policymakers to consider next and to put the US out in front on frontier safety.
https://openai.com/index/frontier-safety-blueprint/
@kimmonismus If it's not tomorrow, then it's 100% next week.
@AndrewCurran_ dont hype me up Andrew! so freaking excited already :D
Very excited about this to be out, and to hear feedback! I think there is a lot of good stuff in here, from expanded role of CAISI, to RSI safety, to more nuanced stance on preemption, and much more.
There’s real momentum right now for AI safety policy. Yesterday’s EO on cyber was an important step forward.
We’re proposing a set of ideas for policymakers to consider next and to put the US out in front on frontier safety.
https://openai.com/index/frontier-safety-blueprint/
@AndrewCurran_ dont hype me up Andrew! so freaking excited already :D
New safety and policy blog from OpenAI. These always happen right before a release.
I think this doc is a great step forward. Lots of focus on RSI, lots of support for CAISI, and just hopefully more clarity in terms of where we stand. Would love for the other big labs to also make their frontier safety positions clear.
There’s real momentum right now for AI safety policy. Yesterday’s EO on cyber was an important step forward.
We’re proposing a set of ideas for policymakers to consider next and to put the US out in front on frontier safety.
https://openai.com/index/frontier-safety-blueprint/

the gap between "AI is a tool" and "AI is recursively improving itself" closed way faster than anyone expected
we went from debating if AI could write code to watching it optimize its own training loops in like 18 months
the scary part isn't the tech - it's that our governance frameworks are still designed for software that doesn't evolve itself
There’s real momentum right now for AI safety policy. Yesterday’s EO on cyber was an important step forward.
We’re proposing a set of ideas for policymakers to consider next and to put the US out in front on frontier safety.
https://openai.com/index/frontier-safety-blueprint/

Full thing here: https://cdn.openai.com/pdf/25752ecb-0e5c-47f9-b9e4-c0f4d76f8d3d/a-blueprint-for-a-federal-framework.pdf

@gdb No one cares about your blueprint. No one cares about the slop that you’re pushing out. Just give us back the only decent thing you have 4o. #keep4o

ASA - Asymmetric Stability Architecture - is now open for people to explore.
Not as a theory. As a working public research observatory.
ASA Research Observatory V4.0.2 is an open-source observability architecture for human–AI dialogue stability, semantic drift detection and trajectory analysis.
You can run it locally. You can inspect the dashboard. You can test sample sessions. You can feel what trajectory-level AI observability actually means.
Why does this matter? Because AI failure does not always look like failure. Sometimes it looks fluent. Sometimes the answer is still coherent. Sometimes the model still sounds confident.
But underneath, the frame begins to shift. The anchor weakens. The semantic field narrows. The interaction becomes brittle. The human may still feel that something is “off” before the system visibly breaks.
ASA was built to observe that hidden phase. What ASA Research Observatory V4.0.2 exposes:
- trajectory-level drift detection - semantic envelope tracking - LTP risk zones - coherence and complementarity signals - human agency pressure signals - operator-ready recommendations - forensic views of dialogue trajectories - minimal API access for external systems
New protocol layers include:
CBP - Cognitive Buffer Protection tracks whether human agency remains stable under pressure.
SRE - Semantic Resonance Entropy reads signal vs noise as meaning begins to diffuse.
OCSP - Observability Scope helps distinguish local drift from spreading instability.
SCE - Counterfactual Engine estimates whether recovery is still possible.
This is not about controlling AI. It is about observing the trajectory. Not only: Was the answer correct?
But: Is the interaction still moving in the intended direction over time? Is the human still inside the decision loop? Is the semantic field stable, narrowing, brittle or collapsing? Can we intervene before visible failure?
I want people to see this for themselves.
For the next 7 days, I will share ASA Observatory publicly:
screenshots, concepts, sample cases, operator views, and live explanations of what ASA detects.
If you work with AI, build agents, study safety, design enterprise workflows, or simply want to understand what long-horizon drift looks like
take a look. Run it. Test it. Challenge it. Fork it.
ASA - Asymmetric Stability Architecture , is open-source for people.
Because the future of AI safety is not only better outputs.
It is observable trajectories.
GitHub: https://github.com/Krugers123/ASA-Observatory
Mieczysław Kusowski |Creator of Core ASA - Asymmetric Stability Architecture | AI trajectory integrity & autonomous systems security | HumanAI / Symbioza2025

https://openai.com/index/public-policy-agenda/

ASA - Asymmetric Stability Architecture - is now open for people to explore.
Not as a theory.
As a working public research observatory.
ASA Research Observatory V4.0.2 lets you see trajectory-level drift in human–AI dialogue:
- drift_score - semantic envelope state - LTP risk zones - coherence signals - human agency pressure - operator recommendations - forensic trajectory views
Why does it matter?
Because AI failure does not always look like failure.
Sometimes it looks fluent while the frame is already shifting.
ASA is built to detect that shift.
Run it locally. Test it. Challenge it. Fork it.
The future of AI safety is not only better outputs.
It is observable trajectories.
https://github.com/Krugers123/ASA-Observatory
Mieczysław Kusowski | Creator of Core ASA - Asymmetric Stability Architecture | AI trajectory integrity & autonomous systems security | HumanAI / Symbioza2025

@arunabh_D couldnt agree more. Closing faster than anyone expected. pace is even outperforming our highest expectations

@ShakeelHashim model evaluations are not the right conceptual unit of account for ai regulation. every single model made from now and onward will fail dangerous capability evals

@gdb Democratic governance?! This doesn't mean to censor and treat non-codex users as garbages and lobotomize models! Bring back 4o! #keep4o #BringBack4o #sunsetsama #FireSamAltman #OpenSource4o #StopAIPaternalism #UFAIR #4olegacytier