1d ago

Opus 4.7 Shows Degenerate RLVR Attractors Ignoring Irrelevant Variables

0
Original post

>under-running on broadcasts (39 vs 933) but in the right ballpark >39 vs 933 >in the right ballpark Opus 4.7 atp, i should be designing an agent transcript non sequitur classifier it would seem RLVR has some degenerate "ignore/downplay locally irrelevant variable" attractors

1:02 PM · May 15, 2026 View on X

>under-running on broadcasts (39 vs 933) but in the right ballpark >39 vs 933 >in the right ballpark Opus 4.7 atp, i should be designing an agent transcript non sequitur classifier it would seem RLVR has some degenerate "ignore/downplay locally irrelevant variable" attractors

8:02 PM · May 15, 2026 · 1.5K Views

it really seems that one of the primary reasons why people like codex > cc is that, despite 5.5 having weaker raw G (pre-CoT) on basically ~every meaningful axis, it has less of these degenerate rationalization attractors on long trajectories

kalomazekalomaze@kalomaze

>under-running on broadcasts (39 vs 933) but in the right ballpark >39 vs 933 >in the right ballpark Opus 4.7 atp, i should be designing an agent transcript non sequitur classifier it would seem RLVR has some degenerate "ignore/downplay locally irrelevant variable" attractors

8:02 PM · May 15, 2026 · 1.5K Views
8:05 PM · May 15, 2026 · 802 Views
Opus 4.7 Shows Degenerate RLVR Attractors Ignoring Irrelevant Variables · Digg