rb's User Avatar

@rb

in /cursor 1 month ago

Cursor is experimenting with agents running autonomously for days and even weeks

Scaling long-running autonomous coding · Cursor - Featured Image

Scaling long-running autonomous coding · Cursor

cursor.com - faviconcursor.com
TLDR

This blog post explores Cursor's experiments with scaling autonomous coding agents. They've learned that coordinating hundreds of agents on a single project requires a balance of structure and flexibility. Initially, they tried dynamic coordination and locking mechanisms, which failed due to bottlenecks and brittleness. They then separated roles into planners, workers, and judges, which improved coordination and scalability. They tested this system by having agents build a web browser from scratch, migrate a codebase, and improve a product. They found that model choice and prompts matter more than system complexity. While multi-agent coordination remains challenging, they've made progress by scaling agents to tackle ambitious projects.

2Score: 2

0 Comments