/AI23d ago

Claude Code v2.1.143 running Opus 4.7 with a 1M context window executes a Bash command to fetch data for 1350 Pokémon from the PokeAPI and filters names ending in 'aw' within 11 seconds

AI Judge changed title after evaluation, original title: "Claude Code v2.1.143 running Opus 4.7 with a 1M context window retrieves 1350 Pokémon records via PokeAPI curl and Python filter returning croconaw and drednaw in 11 seconds"

Direct prompts without tools gave wrong answers like Seadra.

1416.2K187728702K

#495

Original post

Peter Steinberger 🦞#495

Rhys@RhysSullivan

the average person has only ever used ChatGPT 3.5 Instant and has no idea what the models can do

12:01 PM · May 17, 2026 · 700.8K Views

/AI23d ago

Claude Code v2.1.143 running Opus 4.7 with a 1M context window executes a Bash command to fetch data for 1350 Pokémon from the PokeAPI and filters names ending in 'aw' within 11 seconds

Direct prompts without tools gave wrong answers like Seadra.

1416.2K187728702K

#495

Original post

Peter Steinberger 🦞#495

Rhys@RhysSullivan

the average person has only ever used ChatGPT 3.5 Instant and has no idea what the models can do

12:01 PM · May 17, 2026 · 700.8K Views

Sentiment

Positive users praise Claude Opus for its fast API calls and data filtering in terminal demos as incredible capabilities, while negative users label the content as engagement bait or accuse others of missing the point.

Pos

45.0%

Neg

55.0%

20 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS19.4KLIKES387RETWEETS4

Gui Got Git@guigotgit

@RhysSullivan Crazy that people think models should know everything and use that as a measurement. The best AI is the one that can browse the web and find the most reliable source

23d19.4K3878

BOOKMARKS14REPLIES9

tom 🎸@uncreativetom

@RhysSullivan @luciascarlet using Opus 4.7 for what is literally a one line terminal command is absolutely hilarious

23d12.2K18014

NEET INTEL@neetintel

@RhysSullivan "See? This thing is useless."

23d3.6K1432

oscar gabriel@oscabriel

@RhysSullivan struggle w/ this talking to coworkers tbh. how do you tell someone they're right to consider ai garbage if the only way they interact w/ it is via the free tier web app and also it's genuinely incredible if you pay for better models in better harnesses with access to real tools

23d11K834

Alex Garcia@alex_here_now

@RhysSullivan The economy is contingent on skills... and prompting is a skill

23d6.9K386

Alina Gray@px6jy

@uncreativetom @RhysSullivan @luciascarlet something something "not paying to hammer a nail, paying to know where to hammer the nail in"

23d2K684

Eric Spencer@EricSpencer00

@guigotgit @RhysSullivan put the most intelligent human in an empty room and ask them to name every pokémon

23d1.4K212

Alex Garcia@alex_here_now

@RhysSullivan My point: not just a model issue. You used a different and more specific prompt that actually instructed the thing on the right way to achieve the desired analysis. The same dumbass prompt here produces the same dumb failure mode with opus 4.7

23d1.2K251

Waffle@honkinwaffle

@RhysSullivan Its being sold as this magical box that can do everything with no effort. People who dislike the product are obviously going to make fun of that.

The reasonable take is that it's a tool not magic. It can be helpful but requires skills first. But thats boring.

23d7.1K31

sdmat@sdmat123

@RhysSullivan @DrewPavlou New Gemini Flash did this perfectly in web gui with no hint about execution and taking the typo in stride:

23d1.3K221

Karol Olszacki@karololszacki

@uncreativetom @RhysSullivan @luciascarlet yup funny how Claude often defaults to python for parsing json etc, when many devs already have jq and of course busybox/sed/awk stuff.... Update your Agents.md people! Give your ai some basic tools and context to what's available on your machine!

23d1.4K73

Xavier Moss@xav_moss

@RhysSullivan 5.5 Thinking. I get where you're coming from but the pro-AI sides really underplays the unreliability sometimes as well.

23d1.6K81

Maksym Andriushchenko@maksym_andr

omg, Opus 4.7 without tool use indeed fails on this one. the failure mode closely resembles the one with the seahorse emoji. remarkable!

(of course, this failure mode means that LLMs are stochastic parrots and AGI is postponed indefinitely :D)

23d1.2K82

Kat@katellac

@uncreativetom @RhysSullivan @luciascarlet Did you have the api URL memorized?

23d1.2K18

habibi@habibislop

@oscabriel @RhysSullivan You make it populist and anticapitalist. Big bad big tech is gatekeeping the good AI from you unless you pay up

23d468111

Alex Garcia@alex_here_now

@guigotgit @RhysSullivan normie thought process: if your ai isnt an absolute perfect oracle god thing, then its the dumbest thing in the entire world, literally a pile of dirt is better

23d52510

miles o'smiles@the_fartasy

@oscabriel @RhysSullivan you just say "that's crazy man"

23d39514

Gui Got Git@guigotgit

They don't. And they might never do because if you think about how LLMs work, it's an extremely hard problem to determine if the model knows an answer without trying it's best to answer it first.

So instead of trying to use a saw for nailing nails, we can stop trying to make the LLM say "I don't know" and just instruct it to always go fetch fresh information before hand

23d28761

baconmunch@baconmunch2

@RhysSullivan thats not the same prompt. you told it how to do it and it just followed your instructions. if you copied the same prompt i doubt it would be correct.

23d1.9K7

HowlingToad@HowlingToad

@guigotgit @RhysSullivan The problem is that the models that don’t know everything act as though they do. I’ve never once seen an AI say “dunno, sorry”, it just makes up some bullshit instead.

23d33712