/AI2d ago

Builder Redirects Codex Credits to GPT-5.5 Agents Optimizing HVM5

3491828283108.8K
Original post
Taelin@VictorTaelin#1032inAI

Since my 200x Codex credits end tomorrow, I've redirected the Mac Mini cluster to optimize HVM5. Depending on the time it ends, I should have 6h to 30h of dozens of GPT 5.5 agents working on it. I've wrote the most careful anti-reward-hack prompt ever. Let's see how it goes!

6:36 PM · Jun 4, 2026 · 32K Views
Sentiment

Positive users praise the GPT-5.5 agents' 2x HVM5 speedups and trustworthy setup while negative users lament the modest gains, bigger files, and extra spending.

Pos
63.5%
Neg
36.5%
19 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS108.8KBOOKMARKS283LIKES918RETWEETS28REPLIES28
Taelin@VictorTaelin

This has been a massive success!

So I left 32x GPT 5.5 agents (enough to fill the 5h limit) on 32 separate machines. Each one received HVM5's unoptimized file, and a prompt demanding for a 10x speedup. After 4 rounds, all agents reached ~2x speedups, with 1.3x to 2x increases in file sizes.

Many cool ideas surfaced. Some are obvious and most agents rediscovered to them, like computed gotos. But others are actually surprising. For example, turns out a bounded LIFO freelist is MUCH faster than the usual algorithm. Many agents didn't bother trying. Neither did I!

Now I'm in the process of mining insights from the 32 runs so I can merge them into a "super HVM5". Not sure what is the best way to do this, and my 200x credits are about to expire, but I still have a few hours, so I have one shot at getting it right. Sounds fun!

(Also thanks for the free credits, I appreciate it a lot...)

Taelin@VictorTaelin

Since my 200x Codex credits end tomorrow, I've redirected the Mac Mini cluster to optimize HVM5. Depending on the time it ends, I should have 6h to 30h of dozens of GPT 5.5 agents working on it. I've wrote the most careful anti-reward-hack prompt ever. Let's see how it goes!

1dViews 108.8KLikes 918Bookmarks 283
Taelin@VictorTaelin

@CozendeyMath https://gist.github.com/VictorTaelin/5e64c85e4b1b238a8a2cdedff4b40afe

note this is a completely unoptimized reference implementation

I don't want to bias them towards any direction to maximize the chance they have some idea that is worth of keeping

2dViews 3KLikes 51Bookmarks 77
Sterling Crispin 🕊️@sterlingcrispin

@VictorTaelin @TheRealAdamG You should check out Google co-scientist paper it’s aimed at solving essentially this problem, given a population of hypothesis how do you define and rank them etc, Swiss tournament, the ideas get ELO rankings, it’s pretty cool

1dViews 993Likes 33Bookmarks 15
StoicYield@StoicYield

@VictorTaelin

1dViews 456Likes 48Bookmarks 1
Rainstar@mazasiel

@VictorTaelin build atomic task completion. have one model review another model's work. have another model review the first model's reasoning. if both reasoning and code reviews say "it doesnt look like they reward hacked" then the work proceeds, otherwise, it gets thrown away

2dViews 1KLikes 7Bookmarks 4
mathews@CozendeyMath

@VictorTaelin share the prompt? I’m curious

2dViews 2.8KLikes 2Bookmarks 2
Taelin@VictorTaelin

@gbrlvv I'm always working

2dViews 2.2KLikes 19
Youssof Altoukhi@Youssofal_

@VictorTaelin Oh, don’t underestimate its ability to cheat.

2dViews 1.1KLikes 10Bookmarks 1

@VictorTaelin Mr. Taelin what are you doing working, I mean tweeting on a holiday night. Have you not-

2dViews 2.8KLikes 9
Sauers@Sauers_

@VictorTaelin hahahaha

2dViews 1.4KLikes 13
Taelin@VictorTaelin

@extliqprovider I believe it would do much better but I'm not rich

1dViews 912Likes 8
Taelin@VictorTaelin

@sterlingcrispin @TheRealAdamG tyy

1dViews 840Likes 3
Lucca Huguet@luccahuguet

@VictorTaelin @CozendeyMath Hmm you really put in the effort

2dViews 73Bookmarks 1

@VictorTaelin Cursor literally did this to optimize GPU kernels

1dViews 674Likes 5
Taelin@VictorTaelin

@RayLin_AI @TheRealAdamG oh no :(

1dViews 560Likes 1
efe@extliqprovider

@VictorTaelin can opus 4.8 do the same?

1dViews 986
RayLin🐡@RayLin_AI

@VictorTaelin @TheRealAdamG 200x already ended

1dViews 656
JMoon@Jmoon_174

@bettercallsalva @VictorTaelin Hybrid makes sense: cheap local hardware for the 80% of tasks that don't need frontier models, one sub for the work that does. Credits ending just accelerates a decision most people should've made anyway.

1dViews 5Bookmarks 1
Ceoz@Ceoz_1

@VictorTaelin everything into 5.5 Pro, 2-3 years and forget.

1dViews 417Likes 4
Thiago Salvador@bettercallsalva

@VictorTaelin the move back to your own mac cluster once the credits end keeps repeating. when the subsidized tier dries up, the people who understand their workload route it to hardware they control. renting made sense at zero cost, less so when the bill is real.

1dViews 83Likes 1
Load more posts