DeepSeek R1 models multiply numbers with up to 100 digits without external tools by applying scaled chain-of-thought reasoning on the 671B variant
GPT-4 reached only 4% accuracy on four-digit pairs in 2023 tests.
——0——
@teortaxesTex -rw-r--r-- 1 89Tb Jan 13 tons_of_multiplications_train_set.jsonl
Modern LLMs can do multiplication of 100-digit numbers without tools. So much for "embers of autoregression". Just scale the COT bro
2:12 PM · May 22, 2026 · 8.9K Views
7:13 PM · May 22, 2026 · 384 Views