/AI3h ago

Builder Runs MuJoCo RL Training Loop For Unitree Robot At 200k SPS In Browser

3143886126.6K

Original post

kache@yacineMTB#488inAI

I've gotten a mujoco sim RL training loop for a unitree robot at 200k SPS. I'm looking into the physics for friction, contact dynamics. My goal: can I reproduce & beat the mujoco playground RL baselines

This is running in my web browser with raylib. Its the baseline

6:54 AM · Jun 10, 2026 · 14.5K Views

/AI3h ago

Builder Runs MuJoCo RL Training Loop For Unitree Robot At 200k SPS In Browser

3143886126.6K

#488

Original post

kache@yacineMTB#488inAI

This is running in my web browser with raylib. Its the baseline

6:54 AM · Jun 10, 2026 · 14.5K Views

Sentiment

Users liked the realistic robot simulation demo for its speed and collision accuracy but faulted AI companies for misleading shareholders.

Pos

62.5%

Neg

37.5%

12 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS8.1KBOOKMARKS22LIKES178RETWEETS2REPLIES13

kache@yacineMTB

I unfortunately can't use any anthropic models for this, because there is no way for me to know whether I'm silently getting nerfed & sandbagged. Not allowed to make the world a better place

Not enough that they're closed, open source devs need to be slowed down too

kache@yacineMTB

This is running in my web browser with raylib. Its the baseline

2h8.1K17822

kache@yacineMTB

My suspicion is that the contact dynamics, code, is not at all optimized for ultra fast RL. The only reason it is as slow as it is is because no one really tried, and I'm going to try

This is all going to be open source. It's a project. My goal is to accelerate robotics

kache@yacineMTB

This is running in my web browser with raylib. Its the baseline

3h2.5K671

kache@yacineMTB

The problem with the "frontier" distinction is that it is laughably easy to be frontier - there are so many green fields not yet explored. It's like saying you're at the frontier in the middle of kansas. There's fertile ground everywhere.

kache@yacineMTB

I unfortunately can't use any anthropic models for this, because there is no way for me to know whether I'm silently getting nerfed & sandbagged. Not allowed to make the world a better place

Not enough that they're closed, open source devs need to be slowed down too

2h1.6K300

kache@yacineMTB

Particularly, reinforcement learning for robotics, that anyone can do with a single GPU at home. It's going to be CUDA only (thanks jensen..). I am using 3x 4090s, but each training baseline will run on a single 4090.

0 python will be used

3h80112

kache@yacineMTB

I'm going to base my work on top of pufferlib and follow along their development. They are scarily good at making environments for all kinds of tasks - I've done remarkable things with their RL loop. They're taking contracts if you think you have a problem they can help with

2h3498

lycaon@lyc_aon

@yacineMTB this guy walkin dumb as hayl

2h363

Omar@kouhxp

@yacineMTB you go to a restaurant order a ribeye, but chef swaps it for the cheapest cut because it's good for you, but still pay for a ribeye, and the kitchen has tinted windows

2h32

Santi@__selewaut__

@yacineMTB You're cooking

2h51

Shamanic Depressive Gymnosophist (book in bio)@sigilante

@yacineMTB I love how when he falls only the feet have collision bounds.

3h312

NCD@Jokerbernrin

@yacineMTB 200k SPS is impressive already

2h231

dmsimon@dmsimon

@yacineMTB Man, all they're trying to do is add shareholder value.

2h171

Revxnge@myisdimitry

@ChaseBrowe32432 @yacineMTB Ah yes so we should just accept that they SELL a product you PAY for that intentionally sabotages you without you knowing in some cases? Keep in mind Claude is also heavily targeted at enterprises. It’s not a free private use product where this might be more defensible.

2h131

Nosredna@Nosredna

@yacineMTB You're pretty good at analogies. I like this one a lot.

2h111

Artificially Inclined™@Art_If_Ficial

@lyc_aon @yacineMTB This part is a bit too on the nose 😂

How it feels to try to do some perfectly fine future shit and the model says no

2h71

mendez@justmrmendez

I started working on this a while back. but since time is money, I was broke, if this can be solved, it will absolutely scale all your robotic research: 🎁

CoVeRT: Constraint Valid Environment Resampling and Tracking for synthetic environment generation

The system is basically a validity layer for generated simulation states.

Start with one working task scene. It extracts the contact graph, support graph, reachability bounds, joint limits, object affordances, task constraints, collision margins, closest point distances, and penetration depth. Then it generates new scenes by changing poses, clutter, friction, mass, geometry, dynamics, or layout, but only accepts states that still obey the world rules and the task.

So one clean setup can become thousands of usable states: curricula, edge cases, near failures, labels, validity scores, and rejection reports for the broken ones.

This applies beyond robotics. Any 2D or 3D simulation where an AI learns from generated worlds.

Plug that into your harness and have it do some research.

1h15

FunktastiqueP@FunktastiqueP

@yacineMTB If they don't nerf stuff their govt contracts will be at risk. The Trump admin is no joke

2h15

Santi@__selewaut__

@yacineMTB Me: scrapping excel files with Claude 🤡

2h12

RobitOverload@10_X_eng

@yacineMTB What are you using the run the simulations? Do you have a custom framework you are working from or an off the shelf tool for it?

2h12

Paweł J Lisowski@PawelJLisowski

@yacineMTB And thats just beginning. As they grow and control more expect even more of this crazed attitude.

Thats why need more support for open models. Need some actual foundation for training and improving open weight models.

2h7

technohobo@aeither2317

@yacineMTB i look forward to seeing this. i am bored and need a challenge

2h4