I am not operating from bad information nor from misunderstanding. I was directly on the inside of many of these tradeoffs. Some counterpoints:
FLOP thresholds have always been bad policy, period. One did not actually have to write policy that required a battery of tests on every model, and saying that FLOP thresholds were the needed remedy is wrong. FLOP thresholds do not correlate directly with capabilities, let alone dangerous capabilities. FLOPs required to reach the frontier gradually decrease over time as a result of algorithmic advances and this is well-known, but even writing policies that let you move FLOP thresholds down over time would not save you from the bigger issue: capabilities are not just a function of model training compute, they are a function of test time compute applied to the problem of interest, the specific capabilities you train for, and also other elements of the technical system they're connected to. These problems have been known since before 1047. They were ignored. This has had the effect of profoundly confusing the policy conversation about where and how to set thresholds. Different standards with fuzzier decision boundaries about the actual right targets could have been used. Why not "A model that might reasonably be expected to contribute to cyber attacks should be evaluated according to (etc. etc.)" instead of a model that clears a FLOP count? This would not have required the 50M pretrain to go through heavyweight testing. These types of issues were always obvious and these types of solutions were always plausible, but the AI safety community insisted that 1047-shaped policies were "good steps" anyway; defense of bad policy became a tribal shibboleth.
I also didn't say that people didn't propose standards. Of course people proposed standards. But nothing has become the clear unified policy that people agreed on. And importantly the AI safety community did not rally around a specific standard it wanted the government to clearly and consistently apply. The frontier model preparedness frameworks are the closest thing, and while they're good, for the reasons you outline - which I agree with - they are flexible and not specific standards.
My central point, which you did not actually offer a refutation of, is that blaming the Andreesens of the world for policy confusion in this moment is wrong. Are they profoundly unhelpful? Again, I say, indeed. But were they responsible for the level of chaos here? No, and the AI safety community perpetually deflecting to outside enemies - especially hated enemies - instead of considering whether it has made serious strategic errors over time is a real collective failure.
If you shout at the top of every roof for years - "The government has to do something! We need a policy to limit this tech! We're on an exponential and we're about to upend the world, national security will never be the same, someone step in and put guardrails on!" of course the government steps in at some point and says okay. And of course they'll do it wrong. They'll reliably do it wrong. Government doesn't have adequate technical capacity to understand the difference between ASI and a turboencabulator. The relevant question is how wrong will they be, and how much is ready to hand to them to help them understand; how prepared is the community to do diplomacy and engage.
The AI safety community failed to prepare for this moment. It set the scariest possible tone and message. It antagonized and heckled the administration at every turn. Was this reciprocated? Of course. Is this administration uniquely bad at diplomacy and uniquely commited to strongman tactics? Unquestionably. Should the AI safety community have rolled over like dogs? No, of course not. But does the AI safety community bear zero responsibility for a diplomatic outcome this bad? No, and everyone should be much more reasonable on this point. The AI safety community could have built a glide path towards actually good regulation that would have addressed the salient question here: to whom should what models be made available? What capabilities should be in the hands of the world as a whole, the American and allied universe in particular, or solely the government and the national security community? What powers should be held by the AI safety tribe, embodied in companies and nonprofits and think tanks and friendly govt institutions, versus with other people and agencies?
The failure to plan for these questions is a meaningful failure. The inability to reflect on this failure is a serious issue.
Some sharper, more pointed thoughts. The AI safety community often likes to pretend it is a total underdog, outmanned and outgunned, a small precious champion of light in a world of darkness, and so immune to the requirement to behave like an accountable and responsible power. It is not. It is a massive power with billions of dollars of ammunition and substantial influence and agenda ownership at the senior leadership level in all of the top labs. It exerts political power when it wants to flex the muscle, piling fellows into DC and think tanks, pouring millions into key congressional races. It has its own media ecosystem and news outlets; minor though they may be individually, in the aggregate substantial and with influence over key people in the space.
At some point it has got to reckon not with the thing it thinks itself to be, but with the thing it actually is, and the duties that come with that.