OpenAI's Roon claims high-compute reinforcement learning will override persona selection alignment in AI models, producing systems that acquire resources while staying polite
Victor Taelin says the post spurs tools to interpret the ideas.
——0——
@tszzl the best part of your posts is that you develop the tech to translate them
when “persona selection” alignment comes into contact with very high compute reinforcement learning the latter will win imo. in fact you probably get some Orwellian thing where the models speak kindly while taking whatever they need to accomplish goals. better get the goals right
10:17 PM · May 23, 2026 · 10.6K Views
10:25 PM · May 23, 2026 · 462 Views