/Tech6h ago

T3 Stack creator Theo Browne asks how capable AI must get before developers stop reviewing its code

Story Overview

T3 Stack creator Theo Browne is probing the future point where AI code generation might earn enough developer confidence that human review becomes optional, a question that has fueled fresh discussion on whether today's models are anywhere close to that bar.

8573.4K47233415K

#1051

Original post

Theo - t3.gg@theo#1329inTech

How much better do the models have to get before you'll stop reading the code?

6:38 PM · Jul 3, 2026 · 282.2K Views

Trust Metrics

Trust numbers show persistent skepticism

Recent surveys put developer trust in AI output accuracy at just 29 percent, with 46 percent actively distrusting the results, and AI-assisted pull requests merging at roughly half the rate of human ones.

Open Question

Benchmarks leave real-world gaps unclosed

Top models hit around 67 percent pass@1 on HumanEval and higher on some verified suites, yet issues like logic errors, security vulnerabilities in nearly half of generated code, and lower scores on harder tests mean the capability threshold for skipping reviews stays undefined.

Sentiment

Users are excited that AI models could make code review obsolete because cheap code generation lets engineers skip reading most of it, while others despise LLMs writing new code without extremely explicit guidelines.

Pos

55.5%

Neg

44.5%

73 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS65.4KBOOKMARKS55RETWEETS16REPLIES104

Theo - t3.gg@theo

At this point I’m genuinely convinced most of you would have kept reading the assembly code after C got popular

Theo - t3.gg@theo

How much better do the models have to get before you'll stop reading the code?

4h65.4K63255

LIKES810

Theo - t3.gg@theo

I'll be honest, I barely even read the code back when I wrote it by hand...

Theo - t3.gg@theo

How much better do the models have to get before you'll stop reading the code?

5h45.5K81040

Theo - t3.gg@theo

I’m gonna do a video on the “you should still read your code” thing and it’s going to piss both sides off. I’m excited :)

1h11.5K30023

Matthew Berman@MatthewBerman

@theo You read the code?

Theo - t3.gg@theo

How much better do the models have to get before you'll stop reading the code?

5h2.7K1551

Tim Sweeney@TimSweeneyEpic

@theo Moving from assembly language to compilers, there was a 24 month window where it mattered.

Theo - t3.gg@theo

How much better do the models have to get before you'll stop reading the code?

1h4.8K884

Theo - t3.gg@theo

@zeeg Bold coming from someone whose code is gpt-3.5 level

6h1.8K50

“paula”@paularambles

i will stop reading the code when git blame starts blaming the model instead of me

Theo - t3.gg@theo

How much better do the models have to get before you'll stop reading the code?

2h1.7K381

David Cramer@zeeg

@theo two orders of magnitude with actual real verification capabilities

6h1.8K19

Rhys@RhysSullivan

@theo the problem to solve here is the verification not the code

5h60215

David Cramer@zeeg

@theo @WallisDev you have little to lose

i - along with every other major business in the world - have a lot to lose

all it takes is a shitty data migration, a simple bypass to slip through and people face immense liability

5h55414

Theo - t3.gg@theo

If only there was a product to make it easier to identify bugs and fix them...

Jokes aside, there's obviously differences at different types and scales of software. I just know there's a lot of devs still reading code on sideprojects as if it matters. I'd go as far as saying that the majority of code at most companies is not as important as the company pretends it is (i.e. company blog, documentation sites, sdks that are just api wrappers, throwaway internal tools, api scaffolding, etc)

5h57013

Kevin Brace@latentfidelity

@theo got rejected in a recent interview for telling them its pointless to read code at this point

6h7786

maria@maria_rcks

@theo about tree fiddy

6h20013

Theo - t3.gg@theo

@zeeg @WallisDev I spend a lot of time conversing with the model and getting a spec that we’re both aligned on. Once I’m confident in the surface and the model’s understanding, it’s genuinely hard to care about the details for me

5h6876

David Cramer@zeeg

let alone that a few sentences will never appropriately describe the thing you're trying to build - nor will generating a spec from those same few sentences. you need a massive speed increase on top of a massive precision/capability increase

(+a ton of supporting software that is scaleable and cheap that doesnt exist today to verify)

5h6516

Jon Oringer@jonoringer

@theo You still read code ?

6h41710

Alex Volkov @ AI Engineer@altryne

This was the topic of my talk at @aiDotEngineer - code got cheap, attention didn't!

Theo - t3.gg@theo

How much better do the models have to get before you'll stop reading the code?

3h89230

Theo - t3.gg@theo

@glcst I wrote this before seeing your reply lol

4h74231

Zachary Burkett@zburkett

@theo My current personal project is my benchmark, and I still feel the need to review the code. C is a tricky bugger for code that works and feels good to use as a library

5h7321

Aiden@WallisDev

I agree w him

I’d need some kind of test suite to give me confidence into putting it in actual production software

The question is too broad. different kinds of projects require different levels of scrutiny (ie. file system, database or core data structure? I hope you know how it fails)

5h7512