Everyone in Embodied AI is talking about Vision-Language-Action (VLA) models. Almost no one is talking about the physical nightmare of collecting the data to train them. You can't scrape a kitchen table or a warehouse shelf from a web browser.
To get to millions of hours of diverse, real-world manipulation data, you need hundreds of rigs. But you can't buy them. If you build them out of off-the-shelf developer kits, they weigh 15 pounds, overheat in an hour, require bulky cabling, and break the first time an operator wears them on a job site.
At the Instawork Robotics Lab (IRL), we had to build our own. Meet the Instacore: a rugged, 4lb wearable egocentric data-capture engine designed specifically to survive a full shift on a standard power pack.
We didn't build a flashy humanoid. We did hard, blue-collar systems engineering:
💾 THE COMPUTE — A custom MediaTek Genio carrier board that runs completely fanless, routing 5 camera streams directly to on-board storage.
⚡ THE I/O PIPELINE — We ditched USB for industrial GMSL. Thin, ultra-flexible coaxial lines route high-speed data down to the backpack and Power-over-Coax (PoC) back up, completely eliminating batteries on the wrist.
⏱️ UNIFIED SENSOR CLOCK — The MediaTek Genio SoC drives a shared master clock straight to the ISPs driving our 5 global shutter sensors, stamping metadata at the microsecond of capture to ensure zero-drift temporal alignment.
👁️ OPTIMIZED OPTICS — 95 DFOV lenses on flexible PCB ribbon modules keep the wrist cams flat to prevent snags. A 145-degree chest camera captures the macro workspace, while a 50mm baseline rectilinear stereo head pair preserves close-up 3D mapping.
To build hundreds of these, we took over a warehouse in Mountain View in April, called it the Instalab, and brought in talented Pros with assembly backgrounds from Tesla, Apple, and Meta.
To test the systems, we had our Pros wear active rigs while assembling more rigs. The video below shows the raw, time-synchronized Foxglove playback of that exact loop.
Now, we’re shipping these units globally to scale data collection for real Pros on the job.









