Gemini Flash 3.5 matches Sonnet-class models on agentic tasks but costs 7.46 times more than GPT-5.5 at $22.96 per run on PencilPuzzleBench
Excessive verbosity drives context overflows and higher expenses.
——0——
QUOTE POST
#980Lisan al Gaib@SCALING01
what the actual fuck
Gemini 3.5 Flash is 7.46 times more EXPENSIVE than GPT-5.5-xhigh on PencilPuzzleBench
(direct ask scores are below gpt-5.2-high)

7:11 PM · May 20, 2026 · 21.1K Views
the agentic score by itself is fine
but the cost is not real
what the actual fuck Gemini 3.5 Flash is 7.46 times more EXPENSIVE than GPT-5.5-xhigh on PencilPuzzleBench (direct ask scores are below gpt-5.2-high)
7:11 PM · May 20, 2026 · 21.1K Views
7:17 PM · May 20, 2026 · 1.6K Views