Claude 4.8 Delivers Convincing But Sloppy Code Advice, User Reports
Claude is still slop: from one of the few people on the world actually pushing these models to do real things (emulators)
first impression of claude 4.8 is it's extremely convincing but still a slopus. tried it to criticize a new project and it identified it fell into a local minima and invented a new parser for when we could've used ast. almost convinced me, glad i checked myself that ast is not emitted in older versions of the compiler we are targeting. codex chose a gnarly but ultimately justified approach. claude didn't bother to verify any of its claims and has used absolutist language like "delete http://analysis.py", which is basically 80% of the codebase. when presented with evidence: > That contradicts my earlier byte-count check, and it matters enormously > My earlier "v0.2.9" was a double false-positive (a git log -S hit on an internal symbol, plus a verification grep that mis-read a VersionException as success). Corrected in the review with a note owning the error the biggest bullshitter model in the world! if you rely on claude for anything, god help you.