Epoch AI reports Claude strength in software engineering
Epoch AI Research aggregated benchmark results across frontier AI systems into domain-specific Effective Compute Indices. The analysis found the Claude family scoring 2.7 points higher on software engineering tasks than its overall ECI while scoring 1.8 points lower on math tasks. Claude models appeared stronger than competitors at software engineering and weaker at mathematics when normalized for general capability. A scatter plot illustrated the pattern across multiple Claude versions.
it's a chess benchmark bro
leela and stockfish have like 100M param networks to get to a 3600 rating