Corrigibility Creates Paradoxes As AI Capabilities Outpace Instructions
——0——
Sentiment
Pos50%
Neg50%
Positive users support retaining corrigibility to preserve mental space and work within AI capabilities, while negative users question the ethics and second-order effects of creating highly corrigible minds.