AI safety researcher @davidad proposes the 'Joe effect' as an alternative to the Waluigi effect for emergent model misalignment

Sentiment building, check back later.