46m agoGemini 3.5 Flash Leads RuneScape-Bench Per-Skill AI Evaluation——0——Original postOPLA#980Lisan al Gaib|@SCALING01Gemini 3.5 Flash performs exceptionally well on RuneScape-Bench3:09 AM · May 21, 2026 View on XREPLYLA#980Lisan al Gaib|@SCALING01maxbittker.github.iorunescape-bench: AI Agent Benchmark for RuneScaperunescape-bench evaluates AI coding agents on their ability to play RuneScape.LALisan al Gaib@scaling01Gemini 3.5 Flash performs exceptionally well on RuneScape-Bench10:09 AM · May 21, 2026 · 1.3K Views10:10 AM · May 21, 2026 · 527 Views
REPLYLA#980Lisan al Gaib|@SCALING01maxbittker.github.iorunescape-bench: AI Agent Benchmark for RuneScaperunescape-bench evaluates AI coding agents on their ability to play RuneScape.LALisan al Gaib@scaling01Gemini 3.5 Flash performs exceptionally well on RuneScape-Bench10:09 AM · May 21, 2026 · 1.3K Views10:10 AM · May 21, 2026 · 527 Views