DSTop comment: @dawnsongtweets “ALE is built from real work, not synthetic tasks. Every task is derived from a real project that a human expert previously completed, and converted into a verifiable evaluation with objective grading. No vibes. No human judges. Fully reproducible. ALE spans 55 non-physical occupations, grounded in the O*NET / SOC 2018, the U.S. federal occupation taxonomy. Built with 300+ experts from 100+ institutions across science, engineering, medicine, law, finance, education, and many other fields.”