Update: also tried the same prompt with Fable 5
First impression: visually the strongest result so far --- denser village, better terrain/building placement, and nice lived-in details.
Still not perfect though: close-ups show both good details and failure cases, including floating buildings π (check out the threadsπ)
We gave 4 frontier coding agents the same hard environment-gen test:
Preserve the desert landscape. Surgically remove only the ruins. Then build a dense, believable Middle Eastern village from scratch using rustic assets.
Which model did the best job? ποΈπ
Poll and full prompt in thread.

