/Tech1h ago

Microsoft Research Unveils Arbor Autonomous Research Agent

810516838.8K

Original post

Microsoft Research introduces Arbor

A generalist autonomous research agent that uses persistent hypothesis-tree refinement to turn long-horizon exploration into cumulative learning. It beats Codex and Claude Code across 6 research tasks and hits 86% Any-Medal on MLE-Bench Lite.

6:23 AM · Jun 11, 2026 · 3.9K Views

/Tech1h ago

Microsoft Research Unveils Arbor Autonomous Research Agent

810516838.8K

#34

Original post

DailyPapers@HuggingPapers

Microsoft Research introduces Arbor

6:23 AM · Jun 11, 2026 · 3.9K Views

Sentiment

Positive users praise Arbor's hypothesis-tree refinement as methodical search superior to token prediction, while one user questions whether it shows real learning or just lucky results.

Pos

83.3%

Neg

16.7%

4 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS3.1KBOOKMARKS10LIKES17REPLIES2

AK@_akhaliq

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

1h3.1K1710

DailyPapers@HuggingPapers

Paper page: https://paperswithcode.co/paper/2606.11926

CLI, code & docs: https://github.com/RUC-NLPIR/Arbor

Project page: https://RUC-NLPIR.github.io/Arbor/

4h51033

AK@_akhaliq

paper: https://huggingface.co/papers/2606.11926

AK@_akhaliq

Toward Generalist Autonomous Research via Hypothesis-Tree Refinement

1h1.8K32

Petio Lazarov@petiosz

@HuggingPapers i want the ugly trace more than the medal score. if Arbor only shows the winning branch, nobody can tell whether it learned research strategy or just got lucky with retries.

4h53

The AI Therapist@TheAIShrink

@HuggingPapers Persistent hypothesis-tree refinement is methodical search. token prediction never beats systematic exploration. that's Codex's real problem

2h18

Broadstreet@mattbroadstreet

@HuggingPapers @_akhaliq ⏫⏫

1h3

Broadstreet@mattbroadstreet

@_akhaliq ⏫