Basically, the CIs overlap, it is not nerfed, so far looks like it’s the same model.
The community has been asking how Claude Fable 5 compares before vs. after its latest re-deployment.
We collected thousands of votes on the new endpoint across Arenas - Text, Vision, Document, Code, and Agent - and here’s an early score preview.
So far, scores look mostly consistent before and after re-deployment. Fable 5 remains at the frontier across Text, Document, Vision, and Code Arena: Frontend. The ~20-point drop in Frontend is still within the confidence interval as scores continue to stabilize.
We’ll share more insights as more data comes in across all arenas - stay tuned!