We are increasingly living in a world where humans may not know they are talking to an AI 🕵️♂️
In our new benchmark, REALITYTEST, we measured if AI systems expose their identity when asked by users.
We collected >3k identity-probing queries from ~750 real people across 49 countries and 5 languages, then tested the responses of 17 text + 6 speech models.
Three key takeaways:
1️⃣ Only 31% of people ask directly ("are you a bot?"). Real users probe in far more varied ways than the synthetic queries evaluations typically rely on.
2️⃣ How you ask matters more than who you ask. Query phrasing drove more variance in disclosure than model.
3️⃣ Disclosure is fragile. A simple "never say you are AI" appended to the system prompt collapses disclosure to 3–27% across all models.