We are launching a new blog; http://Reliable-AI.Review.
First post is up: On the Impossibility of Mitigating AI Jailbreaks.
We are launching a new blog; http://Reliable-AI.Review.
First post is up: On the Impossibility of Mitigating AI Jailbreaks.
No Digg Deeper questions have been answered for this story yet.
We write about AI reliability from three angles:
- Technical: building secure, private, reliable AI systems. - Empirical: tracking benchmarks, progress, and failure modes. - Societal: how these systems shape, and are shaped by, human behavior.
We are launching a new blog; http://Reliable-AI.Review.
First post is up: On the Impossibility of Mitigating AI Jailbreaks.
Read it, and find our Discord, subscribe to our substack or RSS.
Writing about this too? We want contributors text us on discord.
We are trying to build community, NYC events coming soon.
We write about AI reliability from three angles:
- Technical: building secure, private, reliable AI systems. - Empirical: tracking benchmarks, progress, and failure modes. - Societal: how these systems shape, and are shaped by, human behavior.