ArkSim Open-Sources Framework to Detect Agent Drift in AI Systems
Try ArkSim: https://github.com/arklexai/arksim
Agent drift is what happens when your AI agent passes QA but fails in production. Your prompts are locked down. Your evals pass. Everything looks stable. Then a real user starts pushing the conversation: ▪️the agent gives legal advice ▪️caves after repeated refusals ▪️confidently invents answers These failures usually don’t appear in unit tests. They emerge over long, adversarial, multi-turn conversations. That’s why we built ArkSim, an open-source framework for simulating realistic users against AI agents at scale and evaluating every turn for helpfulness, faithfulness, coherence, goal completion, and more. Read the full article featured on All Things Open: https://allthingsopen.org/articles/agent-drift-open-source-arksim-ai-agent-testing