/Tech37d ago

Joseph Suarez says Pufferlib 5.0 will be a short update to the library for CUDA GPU neural network training on Atari games and continuous action spaces, releasing soon

Pre-trained models run in WebAssembly via the project site.

316241024829.2K

#403

Original post

kache@yacineMTB#403inTech

Pufferlib is insane. You can train neural networks to play games out of the box if you have a CUDA GPU. Like breakout, Atari games, continuous action space problems. You can go to the website right now and they have neural nets running in wasm

6:30 AM · May 23, 2026 · 17.1K Views

Sentiment

Many users praise Pufferlib for simplifying RL training on Atari games via CUDA by treating env throughput as first-class and easing the learning curve, while some dismiss it over GPU requirements and skepticism about hardware scaling.

Pos

80.0%

Neg

20.0%

7 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS10.8KBOOKMARKS55LIKES163RETWEETS2REPLIES6

kache@yacineMTB

This week I'm going to do some research using pufferlib for fun. My intention isn't to push the needle on algorithms, I still have much to learn. I'm mostly trying to share The Good News. Pufferlib has been indispensible for me, for learning more about RL, and properties of NNs

kache@yacineMTB

35d10.8K16355

Joseph Suarez 🐡@jsuarez

@yacineMTB + 5.0 will be a short update - coming soon!

kache@yacineMTB

37d1.3K212

Saylor@seylorra

@yacineMTB wasm + neural nets feels illegal on paper but its such a natural evolution

did u test the built in atari envs yet?

37d23411

Zack's Lab@zackslab

@yacineMTB what kind of GPU do you need to do meaningful stuff in a reasonable amount of time?

what does your setup look like for what you're doing with the drone stuff?

also, what's the break even on your own rig vs just renting H100 hours in the cloud?

37d4261

Nate Brown@ntbrown01

@zackslab @yacineMTB If you can get an H100… about $4/hr. So it’ll take you 500h or about 20 days to spend $2000, or enough for a RTX 3090.

37d721

Nate Brown@ntbrown01

@zackslab @yacineMTB An H100 is like $40k before you get all the infra to run it. But just assuming $40k, it’ll take 416 days to break even on the card itself.

It just depends on your situation, if using the card is fundamental to your revenue stream or you just need it to crank out a model.

37d411

Zack's Lab@zackslab

@ntbrown01 @yacineMTB yeah but i assume a single 3090 is hardly enough to do the same work you’d have done on those first 500 hrs of an H100?

how many 3090s would it take to have the equivalent training capabilities as one H100?

37d75

Orion_seeks@Orion_seeks

@yacineMTB Thanks for puffer pilling me Been messing with neural mmo and customized ocean envs

35d851

Patrick Kuhnke@ku_ds17868

@yacineMTB I also wanna explore the wasm llm stuff -- just implemented the onnx / llama.cpp pipelines in jai for text-embedding -- wasm next?

37d217

thomas@brainage19

@yacineMTB One day if I get bored I might try and train one to play Minecraft and see if it can do better than stuff like Baritone

37d163

Omar@kouhxp

@yacineMTB I stopped reading after CUDA GPU

37d119

Zack's Lab@zackslab

@ntbrown01 @yacineMTB yeah I’m realizing there is a lot more nuance to this depending on what you’re doing.

37d351

Arison@0x_Arison

@yacineMTB good tools like this make the rl learning curve actually manageable

35d86

Team Reagent@Reagent_Systems

@yacineMTB it really is the coolest thing to come out for training in recent memory

37d82

dougvk@dougvk

@yacineMTB but its paid no?

37d70

Satoshi Nakamoto, Andrew Rulnick@MickeySteamboat

@yacineMTB I keep telling everyone the future is 100% PHP WASM and they think that I'm joking...

37d56

kev @AAAI@bad_at_ai

@yacineMTB If you could document and annotate your steps to share would be very nice, I tried once and felt stuck at a very early stage and gave up :)

35d41

dionysup@countkc

@yacineMTB please write an article on it!

35d121

ayan@behindceo

@yacineMTB moore's law ended and now we just rent bigger gpus and pretend it's progress

37d21

Aimar Haddadi@AdvicebyAimar

@yacineMTB i have to check this out

37d14