1d agoChenhao Tan and FAR.AI launch MechEvalAgent to detect implicit hallucinations in mechanistic interpretability researchIt flags when an agent's research claims contradict its code.