Original post
Andrew Drozdov#957
RAGTIME Track ๐ TREC 2026@trec_ragtime
๐จ Every major AI lab is racing to build better "deep research" agents โ systems that search, synthesize, and report across the web.
But how do we actually *benchmark* them?
Introducing ๐งต TREC RAGTIME โ the shared task for rigorous RAG evaluation.
https://trec-ragtime.github.io/
6:53 PM ยท Jun 1, 2026 ยท 582 Views