@RobertTLange @RobertTLange Seems pretty similar to DGM? What do you think the key new elements are?
Self-Harness lets LLM agents autonomously improve their own harness that mediates their interaction with environments without external human/model guidance:
1️⃣ Model-Specific SI Loop: Same model executes tasks + iteratively improves itself via failure mining, targeted harness proposals, and regression-tested validation.
2️⃣ Targeted, Not Generic, Adaptations: MiniMax M2.5, Qwen3.5-35B-A3B, and GLM-5 learn distinct harness modifications tailored to their specific weaknesses.
3️⃣ Agents as Harness Co-Designers: Shift from human-engineered harnesses to agents that actively participate in reshaping their own operating env.
📝: https://arxiv.org/abs/2606.09498