Just on cyber first, the EO is a good step but I’m a little worried about the benchmarking that NSA has to do to define a covered model by cyber capability. Say we set the bar at X, calibrated to today’s defenses. In 6-12 months X catches every model released (as defenses lag but other labs release capable models). Yes, X moves up eventually but in the near term we risk making every model covered. We may end up building a threshold that is no longer useful.
I fear some people still haven’t registered that Mythos/Mythos+ models aren’t cyber models but broadly capable. The cyber focus is much warranted (in fact long overdue) but we’re ignoring so much else. And I often worry that comes back to haunt us 6-12 months from now.