1d ago

Neural Network Scaling Expands Beyond Money Using Clever Data Generators

0
Original post

Scaling neural networks is not a function of purely money. There are all these other dimensions which you can scale that people don't really pay attention to because spending money is the easier thing. But if you don't have money you can scale data, using clever data generators

5:19 AM · May 18, 2026 View on X

You can also scale the total amount of training by just stacking compute multipliers in ways that are not immediately obvious. We have all of these incredible data generators in video games that are now easily decompiled by LLMs. Have you thought about pushing that to the limit?

kachekache@yacineMTB

Scaling neural networks is not a function of purely money. There are all these other dimensions which you can scale that people don't really pay attention to because spending money is the easier thing. But if you don't have money you can scale data, using clever data generators

12:19 PM · May 18, 2026 · 7.1K Views
12:20 PM · May 18, 2026 · 2.1K Views

If you extract the state_i to state_i+1 of a video game, and autonomously ruthlessly optimize that while verifying that the state replay matches the real game how much data could you generate? How many frames per second or states per second could a single CPU core generate?

kachekache@yacineMTB

You can also scale the total amount of training by just stacking compute multipliers in ways that are not immediately obvious. We have all of these incredible data generators in video games that are now easily decompiled by LLMs. Have you thought about pushing that to the limit?

12:20 PM · May 18, 2026 · 2.1K Views
12:21 PM · May 18, 2026 · 1.6K Views

We are late game in language models but we're still not late game in neural networks in general. As as our technology improves, we are unlocking all of this experimentation that was not worth doing. If you're just a little bit creative you'll be able to do some goofy stuff

kachekache@yacineMTB

If you extract the state_i to state_i+1 of a video game, and autonomously ruthlessly optimize that while verifying that the state replay matches the real game how much data could you generate? How many frames per second or states per second could a single CPU core generate?

12:21 PM · May 18, 2026 · 1.6K Views
12:22 PM · May 18, 2026 · 581 Views

You can build your own hardware at the limit you can have a bunch of lidar modules on a 3D printed harness and a stereoscopic camera and you can take that data and then train a very small model that produces the depth. It's very very easy a 4090 is a supercomputer

kachekache@yacineMTB

We are late game in language models but we're still not late game in neural networks in general. As as our technology improves, we are unlocking all of this experimentation that was not worth doing. If you're just a little bit creative you'll be able to do some goofy stuff

12:22 PM · May 18, 2026 · 581 Views
12:25 PM · May 18, 2026 · 2.1K Views

Would cost you probably less than $100 to build this is like a raspberry pi Pico attached to some $5 aliexpress modules. I wondered myself why no one actually does stuff like this and I think it's because they don't understand how things work they don't understand how easy it is

kachekache@yacineMTB

You can build your own hardware at the limit you can have a bunch of lidar modules on a 3D printed harness and a stereoscopic camera and you can take that data and then train a very small model that produces the depth. It's very very easy a 4090 is a supercomputer

12:25 PM · May 18, 2026 · 2.1K Views
12:25 PM · May 18, 2026 · 2.1K Views