.@ChrisPainterYup can you guys introduce a very simple interaction eval where you take your existing long horizon eval and add in neutral interrupts by the human?
then you can step this up to secret info that speeds up or actually allows model to solve things IFF it talks to human?
