Fable Benchmark Demonstrates Strong Calibration for AI Self-Assessment · Digg