/Tech14h ago

Researcher Runs Autonomous AI Agents To Predict Knicks Game Outcome

59215.9K

#1999

Original post

Ravid Shwartz Ziv@ziv_ravid#1999inTech

🏀🤖 There was lovely weather this weekend, so I let my agents run without babysitting. I combined the two things all my feeds are obsessed with right now: the New York Knicks and autonomous research agents. So, who will win the game today? 🧵

8:11 AM · Jun 8, 2026 · 5.2K Views

/Tech14h ago

Researcher Runs Autonomous AI Agents To Predict Knicks Game Outcome

59215.9K

#1999

Original post

Ravid Shwartz Ziv@ziv_ravid#1999inTech

8:11 AM · Jun 8, 2026 · 5.2K Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Posts from X

Most Activity

Ravid Shwartz Ziv@ziv_ravid

The Knicks are up 2–0 on the Spurs in the 2026 Finals. Game 3 is tonight at MSG. Instead of a hot take, I ran an experiment: can an autonomous AI research agent (in the style of @karpathy's "autoresearch") build and tune the prediction model by itself? 👀

14h9913

BOOKMARKS1

Ravid Shwartz Ziv@ziv_ravid

The idea: apply Karpathy's autoresearch loop: let an LLM edit the training code, run a short experiment, keep the change only if a metric improved, and repeat. and point it at tonight's game.

14h4871

LIKES3

kobim@_kobim

@ziv_ravid שאלה של הדיוט (כרגיל): אז האוטוריסרצ׳ הזה זה פשוט אוטומציה של איטרציות על מודלים?

14h2893

RETWEETS1

Ravid Shwartz Ziv@ziv_ravid

I feel that this tweet thread didn't get enough credit. TLDR: My agents build a predication model that predicts the Knicks will win in today's game

Ravid Shwartz Ziv@ziv_ravid

11h66510

REPLIES2

Ravid Shwartz Ziv@ziv_ravid

So, what's the model predicting for Game 3? 🏀 Knicks 108–105, Knicks 59% to win.

That sits right between Vegas (57%) and a separate Monte Carlo "blend" model (68%).

Player projections: • Brunson ~27 pts / 7 ast (±8) • Towns ~18 / 11 reb • Wembanyama ~25 / 11 reb (±9)

14h8511

Ravid Shwartz Ziv@ziv_ravid

What's the baseline? We compare to Vegas - the sportsbook's closing line (the point spread and total set right before tip-off). It already prices in injuries, lineups, rest, and sharp-money opinion, which makes it the gold-standard public predictor of a game.

14h47

Ravid Shwartz Ziv@ziv_ravid

The results? Much better! 🚀 Result: holdout error 9.6 - almost matching Vegas. 🤝

14h45

Ravid Shwartz Ziv@ziv_ravid

My friend CC and I started with scrapetting 6 full seasons (2021 → 2026), including the playoffs (date, teams, scores, playoff flag). For each experiment, the agent proposes changes to the config, runs it, compares the error, keeps the changes if it improved, then repeats.

14h40

Ravid Shwartz Ziv@ziv_ravid

I used the first 4 seasons as training data, 2024-25 as the dev season the agent iterates on, and the final season (2025-26) as a locked holdout - scored exactly once, at the very end. No peeking.

14h39

Ravid Shwartz Ziv@ziv_ravid

So how good is the model, and how does the optimization actually look? Every dot is one candidate config (green = kept, hollow = rejected). The teal line tracks the best so far. You can literally watch the agent learn.

14h39

Ravid Shwartz Ziv@ziv_ravid

Starting from a plain gradient-boosting baseline, the agent found: * pre-game Elo ratings (with margin-of-victory updates) * back-to-back / fatigue flags * Boosting is all you need.

Result: holdout error 11.7 → 11.4, winner accuracy 66% → 69% (above Vegas's 67%).

14h39

Ravid Shwartz Ziv@ziv_ravid

Good, but not perfect. So we tried use more data. I won't go through every detail, but in general, we added injury / availability (who's actually playing tonight), player ratings + projected lineups, richer per-game box scores, and player tracking / shot data

14h37

Ravid Shwartz Ziv@ziv_ravid

It's important to note: at this stage the model does NOT beat Vegas on margin error. What's missing? The closing line aggregates thousands of sharp bettors pricing in injuries, lineups, rest, minutes limits, and late news - info known only hours before tip.

14h37

Ravid Shwartz Ziv@ziv_ravid

@_kobim סוג של. הוא פשוט בוחר איזה פרמטרים לאיזה יודע בדיוק להריץ באופן אוטומטי

14h662

Ariel Noyman@relnox

@ziv_ravid הבייגל שלי יהודי ניקס ברביעי לופ רץ בקלוד הסרתי את עורך הקוד

13h1101

Ravid Shwartz Ziv@ziv_ravid

It's off-course not betting advice - single NBA games are that random. That's the entire point. 🏀🤖

Go Knicks! 🧡💙

14h181

Roy Zuckerman@ZuckermanRoy

@ziv_ravid OK. I'm going to go with 119-99 Knicks. That's what you get if you exclude regular season and run the playoff results only.

13h26