/AI3h ago

Builder Runs MuJoCo RL Training Loop For Unitree Robot At 200k SPS In Browser

3143886126.6K
Original post
kache@yacineMTB#488inAI

I've gotten a mujoco sim RL training loop for a unitree robot at 200k SPS. I'm looking into the physics for friction, contact dynamics. My goal: can I reproduce & beat the mujoco playground RL baselines

This is running in my web browser with raylib. Its the baseline

6:54 AM · Jun 10, 2026 · 14.5K Views
Sentiment

Users liked the realistic robot simulation demo for its speed and collision accuracy but faulted AI companies for misleading shareholders.

Pos
62.5%
Neg
37.5%
12 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS8.1KBOOKMARKS22LIKES178RETWEETS2REPLIES13
kache@yacineMTB

I unfortunately can't use any anthropic models for this, because there is no way for me to know whether I'm silently getting nerfed & sandbagged. Not allowed to make the world a better place

Not enough that they're closed, open source devs need to be slowed down too

kache@yacineMTB

I've gotten a mujoco sim RL training loop for a unitree robot at 200k SPS. I'm looking into the physics for friction, contact dynamics. My goal: can I reproduce & beat the mujoco playground RL baselines

This is running in my web browser with raylib. Its the baseline

2hViews 8.1KLikes 178Bookmarks 22
kache@yacineMTB

My suspicion is that the contact dynamics, code, is not at all optimized for ultra fast RL. The only reason it is as slow as it is is because no one really tried, and I'm going to try

This is all going to be open source. It's a project. My goal is to accelerate robotics

kache@yacineMTB

I've gotten a mujoco sim RL training loop for a unitree robot at 200k SPS. I'm looking into the physics for friction, contact dynamics. My goal: can I reproduce & beat the mujoco playground RL baselines

This is running in my web browser with raylib. Its the baseline

3hViews 2.5KLikes 67Bookmarks 1
kache@yacineMTB

The problem with the "frontier" distinction is that it is laughably easy to be frontier - there are so many green fields not yet explored. It's like saying you're at the frontier in the middle of kansas. There's fertile ground everywhere.

kache@yacineMTB

I unfortunately can't use any anthropic models for this, because there is no way for me to know whether I'm silently getting nerfed & sandbagged. Not allowed to make the world a better place

Not enough that they're closed, open source devs need to be slowed down too

2hViews 1.6KLikes 30Bookmarks 0
kache@yacineMTB

Particularly, reinforcement learning for robotics, that anyone can do with a single GPU at home. It's going to be CUDA only (thanks jensen..). I am using 3x 4090s, but each training baseline will run on a single 4090.

0 python will be used

3hViews 801Likes 12
kache@yacineMTB

I'm going to base my work on top of pufferlib and follow along their development. They are scarily good at making environments for all kinds of tasks - I've done remarkable things with their RL loop. They're taking contracts if you think you have a problem they can help with

2hViews 349Likes 8
lycaon@lyc_aon

@yacineMTB this guy walkin dumb as hayl

2hViews 36Likes 3
Omar@kouhxp

@yacineMTB you go to a restaurant order a ribeye, but chef swaps it for the cheapest cut because it's good for you, but still pay for a ribeye, and the kitchen has tinted windows

2hViews 32
Santi@__selewaut__

@yacineMTB You're cooking

2hViews 5Likes 1
NCD@Jokerbernrin

@yacineMTB 200k SPS is impressive already

2hViews 23Likes 1
dmsimon@dmsimon

@yacineMTB Man, all they're trying to do is add shareholder value.

2hViews 17Likes 1
Revxnge@myisdimitry

@ChaseBrowe32432 @yacineMTB Ah yes so we should just accept that they SELL a product you PAY for that intentionally sabotages you without you knowing in some cases? Keep in mind Claude is also heavily targeted at enterprises. It’s not a free private use product where this might be more defensible.

2hViews 13Likes 1
Nosredna@Nosredna

@yacineMTB You're pretty good at analogies. I like this one a lot.

2hViews 11Likes 1

@lyc_aon @yacineMTB This part is a bit too on the nose 😂

How it feels to try to do some perfectly fine future shit and the model says no

2hViews 7Likes 1
mendez@justmrmendez

I started working on this a while back. but since time is money, I was broke, if this can be solved, it will absolutely scale all your robotic research: 🎁

CoVeRT: Constraint Valid Environment Resampling and Tracking for synthetic environment generation

The system is basically a validity layer for generated simulation states.

Start with one working task scene. It extracts the contact graph, support graph, reachability bounds, joint limits, object affordances, task constraints, collision margins, closest point distances, and penetration depth. Then it generates new scenes by changing poses, clutter, friction, mass, geometry, dynamics, or layout, but only accepts states that still obey the world rules and the task.

So one clean setup can become thousands of usable states: curricula, edge cases, near failures, labels, validity scores, and rejection reports for the broken ones.

This applies beyond robotics. Any 2D or 3D simulation where an AI learns from generated worlds.

Plug that into your harness and have it do some research.

1hViews 15
FunktastiqueP@FunktastiqueP

@yacineMTB If they don't nerf stuff their govt contracts will be at risk. The Trump admin is no joke

2hViews 15
Santi@__selewaut__

@yacineMTB Me: scrapping excel files with Claude 🤡

2hViews 12
RobitOverload@10_X_eng

@yacineMTB What are you using the run the simulations? Do you have a custom framework you are working from or an off the shelf tool for it?

2hViews 12
Paweł J Lisowski@PawelJLisowski

@yacineMTB And thats just beginning. As they grow and control more expect even more of this crazed attitude.

Thats why need more support for open models. Need some actual foundation for training and improving open weight models.

2hViews 7
technohobo@aeither2317

@yacineMTB i look forward to seeing this. i am bored and need a challenge

2hViews 4
Load more posts