🚨 Agents today automate tasks for us with access to our emails, files, and memory. We found that even agents backed by frontier models disclose private information that shouldn't be shared 23–41% of the time. The fix isn't more automation. 1/n
Agents can use tools to gather information, remember context, and act on your behalf.
That makes them useful. It also makes them dangerous. Agents can leak information they shouldn’t.
Introducing PrivacyAlign! PrivacyAlign uses human-annotation-grounded training and evaluation to cut privacy leaks by up to half and make automated privacy evaluation for agents more reliable.
Project Page: https://privacyalign.github.io/
🧵(1/10)


