What if instead of autoresearch reflecting incremental progress from a single person, it reflected *hundreds* of researchers’ progress, updated live, with every run’s data point being interactive and reproducible? You could survey the full space of explored ideas, see which changes actually move the benchmark, and fold the most promising ideas into your next run.
Labless (https://labless.dev) is my attempt at this. It can support thousands of simultaneous training runs across contributors around the world, each learning & improving together to hillclimb on a fixed benchmark, validated using a standardized codebase scaffold.
Importantly, all valid training runs auto-submit to Labless, so failed experiments are as visible and reproducible as the successes. We have an agentic API that lets you or your coding agent study every run submitted so far so you don't waste time investigating something someone already tried, and you can source novel ideas from other contributors.
Labless already hosts our Nanopath challenge to build the best pathology foundation model that trains in 1 hour on a single GPU. We are hoping to expand the platform further and support a suite of open-source research challenges.
If this resonates with you, if you think the path to research innovation lies in open-source collaborative hill-climbing, please reach out.



