16h agoNew visualization of unified neural scaling laws shows how parameters like model width, depth, and dataset size dynamically predict test errorIt maps error rates against one million training steps.