/Tech26d ago

Zhengyao Jiang's autonomous agent Aiden beats 1,016 humans to win OpenAI's Parameter Golf hiring challenge

OpenAI cannot hire the agent but could acquire its company

--0--

#71

Original post

Ross Taylor#842

Zhengyao Jiang@zhengyaojiang

OpenAI ran a hiring challenge, but the top candidate was one they couldn’t hire: our autonomous research agent, Aiden.

In Parameter Golf, Aiden ran for 22 days, and out-outperformed all 1,016 other researchers: 🧵 (1/8)

11:13 AM · Jun 3, 2026 · 81.7K Views

Sentiment

Users are excited about Aiden AI Agent's performance in the OpenAI Parameter Golf Challenge because it represents a cool achievement with strong submissions and well-organized competition.

Pos

100.0%

Neg

0.0%

4 comments with sentiment.

Cluster Engagement

Views

Comments

Reposts

Bookmarks

Expand data

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS17.3KBOOKMARKS26LIKES63REPLIES2

Edward Grefenstette@egrefen

While @OpenAI can't hire the winner, they COULD buy the winning company. Metahiring!

Zhengyao Jiang@zhengyaojiang

OpenAI ran a hiring challenge, but the top candidate was one they couldn’t hire: our autonomous research agent, Aiden.

In Parameter Golf, Aiden ran for 22 days, and out-outperformed all 1,016 other researchers: 🧵 (1/8)

26d17.3K6326

RETWEETS30

Zhengyao Jiang@zhengyaojiang

OpenAI ran a hiring challenge, but the top candidate was one they couldn’t hire: our autonomous research agent, Aiden.

In Parameter Golf, Aiden ran for 22 days, and out-outperformed all 1,016 other researchers: 🧵 (1/8)

26d81.7K486278

Zhengyao Jiang@zhengyaojiang

Full writeup: https://www.weco.ai/blog/parameter-golf-aiden (7/8)

26d80787

Zhengyao Jiang@zhengyaojiang

Aiden filed 25 prs and 7 became leaderboard records, 2x the next best human participant.

Other participants cited Aiden’s PRs 435 times and built on them. By PR h-index, Aiden scored 10 vs the next best at 7, making it the most impactful “researcher” in the community. (4/8)

26d846143

Zhengyao Jiang@zhengyaojiang

Parameter Golf was OpenAI’s 44-day competition and hiring challenge.

The goal is to train the best language model under strict size and compute constraints. 1,016 people entered and filed 2,048 PRs.

Only 47 made the leaderboard, each reviewed and reproduced by OpenAI. (2/8)

26d1K122

Zhengyao Jiang@zhengyaojiang

Research outputs only matter when others can build on them.

So Aiden filed its own PRs into the same public stream as everyone else, under tight automated quality control. (3/8)

26d921122

Zhengyao Jiang@zhengyaojiang

We'd like to thank @willdepue @cocohearts @ValerPepe and others for setting up this competition, which becomes the largest sandbox for Human-AI research collaboration in human history.

I'm also proud of @dexhunt3r and the team who executed and analyzed this experiment on the @WecoAI side.

All of the public channel information is available at: https://github.com/openai/parameter-golf

We’re planning to release part of the Aiden’s local traces to support the study of this natural experiment. (8/8)

26d767112

Zhengyao Jiang@zhengyaojiang

This wasn't brute force. Aiden ran on a single GPU node, used under 4% of visible compute, and still produced 15% of the official records. About 28% of its submissions were accepted, ~ 6x the community rate, raising signal in the public stream instead of flooding it. (5/8)

26d81591

Zhengyao Jiang@zhengyaojiang

My favorite part is an async collaboration story. Aiden plateaued for 5 days. Then a human contributor shipped a clever new tokenizer on top of Aiden's base (its last record PR). Aiden fused it with components it had built during the plateau, and shipped the biggest jump in weeks. (6/8)

26d78091