/Tech1h ago

Microsoft Research Unveils Arbor Autonomous Research Agent

810516838.8K
Original post
DailyPapers@HuggingPapers

Microsoft Research introduces Arbor

A generalist autonomous research agent that uses persistent hypothesis-tree refinement to turn long-horizon exploration into cumulative learning. It beats Codex and Claude Code across 6 research tasks and hits 86% Any-Medal on MLE-Bench Lite.

6:23 AM · Jun 11, 2026 · 3.9K Views
Sentiment

Positive users praise Arbor's hypothesis-tree refinement as methodical search superior to token prediction, while one user questions whether it shows real learning or just lucky results.

Pos
83.3%
Neg
16.7%
4 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS3.1KBOOKMARKS10LIKES17REPLIES2
AK@_akhaliq

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

1hViews 3.1KLikes 17Bookmarks 10
DailyPapers@HuggingPapers

Paper page: https://paperswithcode.co/paper/2606.11926

CLI, code & docs: https://github.com/RUC-NLPIR/Arbor

Project page: https://RUC-NLPIR.github.io/Arbor/

4hViews 510Likes 3Bookmarks 3
AK@_akhaliq

paper: https://huggingface.co/papers/2606.11926

AK@_akhaliq

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

1hViews 1.8KLikes 3Bookmarks 2

@HuggingPapers i want the ugly trace more than the medal score. if Arbor only shows the winning branch, nobody can tell whether it learned research strategy or just got lucky with retries.

4hViews 53
The AI Therapist@TheAIShrink

@HuggingPapers Persistent hypothesis-tree refinement is methodical search. token prediction never beats systematic exploration. that's Codex's real problem

2hViews 18
Broadstreet@mattbroadstreet

@HuggingPapers @_akhaliq ⏫⏫

1hViews 3