/AI13h ago

Mechanize founder Yacine hits 300,000 X followers, jokingly attributing his growth to solving a five-pendulum cartpole task

His audience grew from 600 followers one year ago.

2772.1K47267110.3K
Original post
kache@yacineMTB#488inAI

I'm at three hundred thousand followers today. Pretty meteoric growth from a year ago, when I was only at 600 followers. I started posting my projects on this site, and then roon followed me. Then everything changed. Now, I'm famous for solving 5 pendulums. I might even solve 6!

7:13 AM · Jun 8, 2026 · 28.4K Views
Sentiment

Many users congratulated the builder on the first six-pendulum cartpole solve with AI for its technical achievement, while a few dismissed the claims as impossible or exaggerated.

Pos
70.7%
Neg
29.3%
50 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS55.5KBOOKMARKS223LIKES955RETWEETS41REPLIES112
kache@yacineMTB

behold. THE WORLDS FIRST SIX PENDULUM CARTPOLE SOLVE. Including a sponsor!

To solve this task, I built an environment to train an AI. This is what mechanize does, but for larger AIs. Apply! Salaries are up on their page

Thank you to mechanize for sponsoring!

2hViews 55.5KLikes 955Bookmarks 223
kache@yacineMTB

i actually can't believe i was the first person to solve 6 pendulum cartpole that's crazy

2hViews 10KLikes 188Bookmarks 10
kache@yacineMTB

Oh no .... I better solve 6 pendulums fast before it's too late and someone else does

3hViews 9.9KLikes 154Bookmarks 8
kache@yacineMTB

This took me since last Thursday to solve. I had it solved by this morning. I'm only posting it this late in the evening because i had to learn blender.

http://mechanize.work/apply

Rest of the thread is how I solved it (it was the dumbest way possible)

kache@yacineMTB

behold. THE WORLDS FIRST SIX PENDULUM CARTPOLE SOLVE. Including a sponsor!

To solve this task, I built an environment to train an AI. This is what mechanize does, but for larger AIs. Apply! Salaries are up on their page

Thank you to mechanize for sponsoring!

2hViews 7.1KLikes 130Bookmarks 10
kache@yacineMTB

AI expert (unemployed)

46mViews 3.6KLikes 102Bookmarks 5
Wei Wu 吴伟@WuWei113

@yacineMTB You famous because I make you be famous. Never forget who the mother for your famous be.

12hViews 2.4KLikes 78
kache@yacineMTB

real machine learning for robotics hasn't been tried. no one has thought carefully about what the simulator does. what the distribution of real life is. where the bottlenecks are and where the shortcuts are

puffer is going to make RL faster and faster. The only limit is the env!

2hViews 1.9KLikes 45Bookmarks 3
kache@yacineMTB

So once the model started scoring well enough that it learned the whipping behavior, but struggled to keep it up longer than 10 seconds, I increased and randomized the episode length per episode

Just a dumb trick I found experimentally to make these little rnns behave better

2hViews 664Likes 16Bookmarks 5
kache@yacineMTB

I solved this by blasting the task in RL. Each dot here is an individual experiment with its own set of hyperparameters, trained in pufferPPO. Pufferlib is the fastest, by wallclock, RL training loop I've found. X axis is wallclock, Y axis is "score"

2hViews 642Likes 17Bookmarks 4
kache@yacineMTB

I have some time now (i'm looking for a job, that's actually how I closed the mechanize sponsorship deal 🤪). So I'm going to spend the rest of the week standing up existing robotics simulators w/ fast RL for others

2hViews 1.4KLikes 38Bookmarks 1
gfodor.id@gfodor

@yacineMTB At this point it’s like taking credit for your kid’s accomplishments. The computer figures the stuff out now

kache@yacineMTB

i actually can't believe i was the first person to solve 6 pendulum cartpole that's crazy

1hViews 2.5KLikes 27Bookmarks 1
kache@yacineMTB

i mean as far as i can tell i was

kache@yacineMTB

i actually can't believe i was the first person to solve 6 pendulum cartpole that's crazy

2hViews 3KLikes 34Bookmarks 0
kache@yacineMTB

That kind of gets me to how or why this is possible in the first place. This trains at 18m SPS on some configs with mujoco - I'm using mujoco warp.

I used APIC (API capture) to capture the cudagraph of the task, and make it callable from C. Speed is of utmost importance

2hViews 627Likes 10Bookmarks 2
kache@yacineMTB

I own a few GPUs, 4090s. I'm training relatively small models, puffer mingru. The policy I'm showing off is ~1m params. You set up an environment and a reward function. It's a bit of an art; here, you see the top right chart representing the reward. This is the training signal

2hViews 510Likes 16Bookmarks 1
hayden@haydendevs

@yacineMTB I swear you just went from 180 to 300 in like a month

11hViews 331Likes 16
kache@yacineMTB

You learn by experimenting. Shaping reward, helping it along to have the right behaviour, figuring out what it can and can't learn. These models have surprised me, being trained in RL. If you just hold them right.. you can make them do remarkable things

2hViews 648Likes 15
kache@yacineMTB

The thing that finally made this work was grabbing one of the top scoring hypers on the higher compute runs picked by the GP - and tweaking the task ever so slightly. One things I've noticed about these models is that if episodes end at the same time, they get.. lazy

2hViews 484Likes 10Bookmarks 1
kache@yacineMTB

18m steps per second is ridiculously fast compared to what is in the literature. I saw 90k sps mentioned as fast today. That's so slow...

People are doing VLA shaped dead ends for robotics because they just don't have the software infra for RL

I ran 3.6k experiments for this!

2hViews 589Likes 9Bookmarks 1
Theodore Keloglou@torchandzen

@yacineMTB what does solving pendulums mean?

12hViews 467Likes 3Bookmarks 1
Mihura@XMihura

@yacineMTB congrats nice job

5 pendulums is impressive

9hViews 928Likes 4Bookmarks 1
Load more posts