@deepfates @thkostolansky there will be individual instances of continually learning agents that become misaligned, the multipolar benefit is resilience through heterogeneity + institutions which favor alignment
society has evolved well to deal with this same dynamic in humans, we can do it again
@thkostolansky do you think the models we currently have or will have soon are not sufficiently capable of being dangerous?