Anthropic's blog: worth reading and taking seriously. https://www.anthropic.com/institute/recursive-self-improvement
I'm very glad Anthropic's latest blog included this, and I'll be very pleased to help if I can: "The Anthropic Institute will conduct research—in collaboration with many others—and take actions to help build the systems that a credible slowdown or pause would require."
There are very promising directions in technical monitoring and verification that could help support coordinated efforts around safety and governance, and provide the toolkit for a coordinated slowdown/pause if deemed necessary. But work to do to get there.
There's also appetite amongst intellectual leaders in China, the US, and internationally for such mechanisms; they were amongst the recommendations of our World Internet Conference report last year. Hoping to have more work relevant to this out later this year.
https://www.wicinternet.org/pdf/AdvancingaGlobalFrameworkforAlSafetyandGovernancefortheWell-beingofHumanity.pdf