Open-ended coding training data may no longer be the bottleneck: AI can scale open-ended tasks—and even outperform human-expert curation.
FrontierCS team is releasing FrontierSmith: a system for synthesizing open-ended coding problems at scale. Starting from closed-ended coding tasks, FrontierSmith mutates, filters, and builds runnable optimization environments for long-horizon coding agents. In our experiments, FrontierSmith data trains stronger models than human-curated open-ended data on FrontierCS and ALE-bench.
Blog: https://frontier-cs.org/blog/frontiersmith/ Paper: https://arxiv.org/abs/2605.14445 Code: https://github.com/FrontierCS/FrontierSmith Model: https://huggingface.co/runyuanhe/qwen35-9b-frontiersmith


