GPT-5.5 Shows Limited Progress On Internal Research Debugging And OPQA Evals · Digg