From playing around with /goal
It feels like there's less and less of a need to build any type of workflow manually (whether through code, drag and drop, or a prompt). Instead, specify the goal, let the model intelligence figure out the underlying steps.
If the task is repeatable, then you can gather a dataset with ground-truth, and hillclimb it for increased cost / lower accuracy. To some extent this is what every non-frontier lab is optimizing for.
The world is moving from prompt engineering -> goal and eval engineering.

















