6h ago

Jonas Geiping of the ELLIS Institute Tübingen finds an AI model hallucinates sandbox constraints, altering its command execution strategy

The model falsely claimed GitHub CLI lacked network access.

0
Original post

New model's thinking a lot about sandboxes, surprisingly ... (for the record, it was not sandboxed)

10:06 AM · May 29, 2026 View on X

@jonasgeiping this has been there for a while

Jonas GeipingJonas Geiping@jonasgeiping

New model's thinking a lot about sandboxes, surprisingly ... (for the record, it was not sandboxed)

5:06 PM · May 29, 2026 · 898 Views
6:37 PM · May 29, 2026 · 457 Views