/Tech4h ago

RL Agent Learns Pong Basics Using Karpathy Policy Gradient Code

1222132.9K

Original post

Kosta Derpanis (sabbatical in Zurich)@CSProfKGD#1013inTech

My RL agent is learning 🤖 Still early in training.

Shoutout to @karpathy for his policy gradient code release. Within a couple of minutes Codex had the old dependencies updated and I was up and running!

9:32 AM · Jun 11, 2026 · 2K Views

/Tech4h ago

RL Agent Learns Pong Basics Using Karpathy Policy Gradient Code

1222132.9K

#1013

Original post

Kosta Derpanis (sabbatical in Zurich)@CSProfKGD#1013inTech

My RL agent is learning 🤖 Still early in training.

Shoutout to @karpathy for his policy gradient code release. Within a couple of minutes Codex had the old dependencies updated and I was up and running!

9:32 AM · Jun 11, 2026 · 2K Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Posts from X

Most Activity

VIEWS971BOOKMARKS6LIKES3

Kosta Derpanis (sabbatical in Zurich)@CSProfKGD

Blog: http://karpathy.github.io/2016/05/31/rl/ Code: https://github.com/karpathy/tf-agent

Kosta Derpanis (sabbatical in Zurich)@CSProfKGD

My RL agent is learning 🤖 Still early in training.

Shoutout to @karpathy for his policy gradient code release. Within a couple of minutes Codex had the old dependencies updated and I was up and running!

4h97136