Increasingly, I am also leveraging other models like Deepseek, Qwen, and Minimax for the evaluator agent in my /goal automoously loops.
Same here. Happy with Opus 4.8 (planning) and GPT-5.5 (execution).
Also, breaking steps into smaller ones for increasing quality is so underrated. This is why dynamic workflows are a bigger deal than most people think.
