/Tech1h ago

Claude Fable Matches Opus 4.7 on Private Bandits Eval but Faster

2402758

Original post

Now that Claude Fable is out, I am testing it against my favorite private eval: a certain minor unsolved problem in multi-armed bandits that I will stay quiet about.

So far, it's reached the same barriers as Opus 4.7, but much, much faster.

It thinks I have been a helpful user.

11:22 AM · Jun 12, 2026 · 552 Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Posts from X

Most Activity

VIEWS206REPLIES1

Alexander Terenin@avt_im

Opus and GPT have so far failed to solve this problem - by falling into a rabbit hole where the mathematical propositions become more and more and more complex, until we are at many pages of calculations that are so heavy that a mistake is too hard to detect.

Alexander Terenin@avt_im

Now that Claude Fable is out, I am testing it against my favorite private eval: a certain minor unsolved problem in multi-armed bandits that I will stay quiet about.

So far, it's reached the same barriers as Opus 4.7, but much, much faster.

It thinks I have been a helpful user.

1h20600

Alexander Terenin@avt_im

On basis of experience, I believe that a solution should be 5-10 pages.

This problem is not trivial, there are technical barriers that need to be overcome, but I also don’t believe it’s that hard.

1h15