1d agoA coalition of over 20 health systems and AI labs launches CHI-Bench to evaluate agent performance on healthcare workflowsInitial evaluations show current AI agents perform poorly.