/AI5h ago

DSPy Data Agent Hits 90% On DataAgentBench Stockmarket Benchmark

07251.2K
Original postShreya Shankar#409
paari_7@happened_7

Ran our DSPy+ RLM data agent on @UCBEPIC DataAgentBench's stockmarket set: 2754 tables, no schema dumping, $0.15 total. Hit 90% vs prev best 76% (PromptQL + Gemini 3.1 Pro). Running Gemma 4 31B via OpenRouter.

Memory compounds across queries, bakes into GEPA instructions agent gets smarter mid-run.

Full 12-dataset run tonight. Gunning for first open-source leaderboard entry 馃憖

2:08 PM 路 Jun 9, 2026 路 1.2K Views
Sentiment
Sentiment building, check back later.
Cluster Engagement
Posts from X
Most Activity
Most Activity
No ranked X posts are available for this story yet.