VIEWS913BOOKMARKS2LIKES29RETWEETS1REPLIES1

Tenobrus@tenobrus
similarly, seems like a case of the model censoring its reasoning.
3hViews 913Likes 29Bookmarks 2
The model exhibited no actual sabotage or evaluator deception.

similarly, seems like a case of the model censoring its reasoning.
@tenobrus me when my black box oracle machine does unexpected things in oracular language

@tenobrus this is a long time coming

@tenobrus It's ready to work in customer service