Yoav Gur Arieh finds most unfaithful chain-of-thought detectors perform near random chance · Digg