Talk recording here: https://underline.io/lecture/144282-core-safety-values-for-provably-corrigible-agents
For those who can't make it:
Slides: https://anayebi.github.io/files/slides/AAAI26_MEW-11.pdf
Blogpost summary: https://www.lesswrong.com/posts/M5owRcacptnkxwD2u/from-barriers-to-alignment-to-the-first-formal-corrigibility-1