/Tech33d ago

OpenAI, Thrive, and Crete deploy Codex-powered Tax AI, cutting preparation times by one-third with up to 97% accuracy

AI Judge changed title after evaluation, original title: "OpenAI's Boris Power builds a self-improving tax agent that reached 90% completion on nearly 60% of tasks"

The system increased tax firm throughput by 50%.

1242.3K981.1K569.2K

#32

Original post

Brandon McKinzie#1004

Samay@samaysham

At @ThriveHoldings, we built a product with @OpenAI to automate tax prep for the 30+ accounting firms we own across the country.

This season, it processed 7k+ returns. But what I think is more interesting is that the product meaningfully self-improved as accountants used it.

7:56 AM · May 27, 2026 · 331.2K Views

Sentiment

Many users praised OpenAI and Thrive's self-improving tax AI for delivering major time savings in tax prep and fitting verifiable domains well, while others flagged risks of error compounding and added regulatory overhead.

Pos

61.1%

Neg

38.9%

19 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

OPENAIVia

#381

Posts from X

Most Activity

VIEWS189.2KBOOKMARKS238LIKES837RETWEETS29REPLIES47

Greg Brockman@gdb

OpenAI for self-improving tax agents:

Samay@samaysham

At @ThriveHoldings, we built a product with @OpenAI to automate tax prep for the 30+ accounting firms we own across the country.

This season, it processed 7k+ returns. But what I think is more interesting is that the product meaningfully self-improved as accountants used it.

33d189.2K837238

Samay@samaysham

Read more about our work in detail with @OpenAI below.

https://openai.com/index/building-self-improving-tax-agents-with-codex/

33d4.3K3536

Samay@samaysham

The unlock was a self-improvement loop.

We record production misses: unsupported fields, wrong predictions, and corrections. Codex then uses that context to autonomously create evals from production data, hillclimb against them, and open candidate PRs for engineers to review. ⚡️

33d3.3K4029

Rohan Paul@rohanpaul_ai

OpenAI and Thrive just built a self-improving tax agent with up to 97% accuracy.

Tax AI processed 7,000 returns across 30+ accounting firms, saved about one-third of preparation time, reached up to 97% accuracy, and raised throughput by about 50%.

The hard part was not reading W-2s or 1099s, but handling messy K-1s, rental schedules, notes, spreadsheets, prior-year files, and values that must match across documents.

The system records the full trace: source file, extracted field, citation, tax-engine mapping, accountant correction, and final filed value.

Repeated corrections become eval targets, so Codex gets a narrow task with evidence, code, tests, and a pass condition.

A wrong tax field can come from many places: bad extraction, weak mapping, unsupported workflow, prior-year carryover, or human judgment.

The clever part was not simply using Codex to write fixes, but building a product environment where repeated practitioner corrections became bounded, testable engineering tasks.

In the rental-property example, the agent could inspect source documents, extraction traces, mapper behavior, expected outputs, and regression tests before proposing a change.

33d4.9K3922

Linus@thesephist

I really appreciate the lessons and technical ideas @samaysham & team were able to share about their tax agent system, which learns from production traces to self-improve via detailed tracing tightly integrated into deployment + an autonomous AI engineer.

Samay@samaysham

At @ThriveHoldings, we built a product with @OpenAI to automate tax prep for the 30+ accounting firms we own across the country.

This season, it processed 7k+ returns. But what I think is more interesting is that the product meaningfully self-improved as accountants used it.

33d6.3K3213

Boris Power@BorisMPower

Building evaluations and fast feedback and iteration loops is the most reliable way of solving hard problems and improving the product rapidly.

Here, the amazing Crete team with collaboration from OpenAI automated majority of the field extraction work, done during tax preparation within weeks!

33d3.7K499

Boris Power@BorisMPower

A glimpse of an exciting future where tax professionals will be able to spend more time advising customers and explaining the tax returns!

Samay@samaysham

At @ThriveHoldings, we built a product with @OpenAI to automate tax prep for the 30+ accounting firms we own across the country.

This season, it processed 7k+ returns. But what I think is more interesting is that the product meaningfully self-improved as accountants used it.

33d5.8K397

Boris Power@BorisMPower

Read the full blog on how this was achieved: https://openai.com/index/building-self-improving-tax-agents-with-codex/

Boris Power@BorisMPower

Building evaluations and fast feedback and iteration loops is the most reliable way of solving hard problems and improving the product rapidly.

Here, the amazing Crete team with collaboration from OpenAI automated majority of the field extraction work, done during tax preparation within weeks!

33d1K86

Samay@samaysham

One accountant told us she spent 180 hours on tax prep last year. This year she spent 15.

She used that time to call every client and walk them through their return.

That’s what gets me excited. Less time buried in prep and more time as a strategic advisor to clients.

33d2.6K331