ARC and @aicrowdHQ are launching a ≥$100k contest for white-box estimation algorithms: given the weights of an MLP, the goal is to estimate the expected output of the network on Gaussian inputs. (Thread)
Alignment Research Center's Jacob Hilton launches a $100,000 challenge to estimate MLP outputs directly from network weights
A warm-up round for the competition is currently live.
Most Activity
It would be nice if AI companies and others (e.g. startups) tried to have their AIs hillclimb on this task.
ARC is approximately our only current bet on scalable/worst-case solutions to alignment and they could be boosted by relatively checkable work!
ARC and @aicrowdHQ are launching a ≥$100k contest for white-box estimation algorithms: given the weights of an MLP, the goal is to estimate the expected output of the network on Gaussian inputs. (Thread)