1d ago

Academic Lianhui Qin argues a SimWorld 3D city test shows frontier coding agents still struggle with spatial reasoning

The Gemini output placed a giant building in a street.

0
Original post

We asked 4 frontier coding agents to build the same Unreal 3D city scene in SimWorld Studio. Same prompt. Different worlds 👀 Claude Code + Opus 4.7 Codex + GPT-5.5 Cursor + Composer 2.5 OpenCode + Gemini 2.5 Pro Who wins?

12:38 PM · May 28, 2026 View on X
Reposted by

I’m not sure Gemini 3 looks that much more impressive here.🤔

For example, why is there a giant White House–like building just sitting in the middle of the street?

This feels like a real example of how even frontier coding agents can still struggle with spatial reasoning.

SimWorldSimWorld@simworld_ai

Missed Gemini 3 yesterday, but catching up now This genuinely looks impressive!

5:50 PM · May 29, 2026 · 1.7K Views
6:50 PM · May 29, 2026 · 771 Views

I’m not sure Gemini looks that much more impressive here.🤔

For example, why is there a giant White House–like building just sitting in the middle of the street?

This feels like a real example of how even frontier coding agents can still struggle with spatial reasoning.

SimWorldSimWorld@simworld_ai

Missed Gemini 3 yesterday, but catching up now This genuinely looks impressive!

5:50 PM · May 29, 2026 · 1.7K Views
6:27 PM · May 29, 2026 · 81 Views