8h ago

Stanford Law Releases JudgmentBench Dataset To Evaluate LLM Legal Work

0
Original post

Announcing JudgmentBench – a dataset we at @StanfordLaw liftlab developed along with @harvey and @SnorkelAI that evaluates frontier LLM work product. The dataset contains 30 real-world tasks crafted by Biglaw attorneys paired with >3000 rubric and preference expert annotations.

11:12 AM · May 27, 2026 View on X