A clarification: I'm not against open-source AI at all, yet I do think regulation will be necessary as open-source capabilities approach those of frontier models such as Mythos and beyond (sorry!)
Open-source models have a valuable place to democratize access to automation and create value for everyone
But that doesn't mean that I'm not concerned about abuse and related severe risks, or that I'm naive about the alternatives to regulating these models: more general surveillance and monitoring - see eg the recent open letter calling for regulating/tracking synthetic DNA. (If open and local models that are available to everyone aren't safe and can't easily be monitored, monitoring and surveillance will move to a different layer in society)
There is also nice existential risk argument in *favor* of open-source models: they can slow down the frontier by squeezing the margins and reducing the willingness for additional $$$ research compute. If open-source models are good enough for value creation in sufficiently many use cases, they will become favored over closed models. This might slow down frontier research by itself slightly and give us a needed breather
Medium term to prepare for regulation, we should make sure that open-source models are not trained on data that is related to risk areas. If a model can still reason it's way to risks using a ton of test-time compute/token and experiments, that is a trade-off and might well be okay if it takes long enough and is expensive enough
Interesting research questions are about creating filtered datasets and researching on how to make sure that models cannot be abliterated and don't have plasticity in areas we don't want to (eg if the model bad at biorisiks, it should stay that way even if we try to pre-train it some more or fine-tune it). Can we do that?
(As always: my own opinion and not on behalf of GDM or G...)






