1d ago

Opus 4.8 self-corrects on a trick spelling puzzle while DeepSeek V4-Flash fails to match the Pro version

The prompt asks models to count days containing the letter d

0
Original post

Opus 4.8 is insane, nothing will be the same after this model 💀

8:53 PM · May 28, 2026 View on X

Weird capability difference between DSV4-Flash and Pro They have identical tokenizers, largely identical data (32 vs 33T, and I think extra 1T for V4-Pro is from Flash traces), they reason the same. But Flash consistently insists that at least Friday has no 'd'.

IrushiIrushi@Im_IrushiK

Opus 4.8 is insane, nothing will be the same after this model 💀

3:53 AM · May 29, 2026 · 1.2M Views
1:47 AM · May 30, 2026 · 6.1K Views