Users doubted AI's ability to generate advanced problems for the FrontierMath benchmark because its suggestions were typically bland, boring, and insufficiently novel compared to human experts.
3 comments with sentiment.
Researcher Details Challenges Using AI to Generate FrontierMath Problems · Digg