Excited to share our new paper using cognitive science to distinguish AI agents and humans!
We administered CogCAPTCHA30, a set of 30 cognitive tasks, to frontier VLMs (GPT-5, Sonnet 4.5, Gemini 2.5 Pro) and humans. We found that processes differ between AI agents and humans - even when the final output is identical.
Link: https://arxiv.org/abs/2605.06524
This work was led by @milenamr7 and co-authored with @cocosci_lab, and @mayankagrawal