Kaito tests Anthropic's Opus 4.8 on a complete codebase refactor, consuming 100 million tokens without producing any working code
— The automated two-hour run generated 172,631 line additions.
——0——
Sentiment
Pos3.6%
Neg96.4%
Many users criticized Anthropic Opus 4.8 for failing a massive codebase refactor after consuming 100 million tokens, calling the outcome wasteful and ineffective.