/Tech2h ago

Specialized Medical AI Matches Frontier Model Using Clinician Feedback

218283K

Original post

"You don’t need frontier scale to reach frontier quality" in specialized domains, you need the right expert feedback loop.

Heidi says it matched Sonnet 4.6 in clinical search with a much smaller model trained on clinician preferences instead of raw scale.

Heidi Evidence is a clinical search tool where doctors ask medical questions and get sourced answers.

Here, clinicians were shown the same medical question with 2 anonymous answers, one from Heidi’s smaller model and one from Sonnet 4.6, and they picked Heidi’s answer 49.9% of the time.

In medicine specifically, the hard problem is knowing when to search, what to cite, how much to say, and when a vague answer is worse than no answer.

Tom Kelly@TomkeyKong

There’s been debate in the last couple days about whether general models beat specialized medical AI. It's the wrong question. This is an argument about how to measure.

You don't need frontier scale to reach frontier quality. Six weeks ago we matched the best frontier model in Heidi Evidence with a model of our own, a fraction of the size.

Here's how. 🧵

8:48 AM · Jun 15, 2026 · 2.1K Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Posts from X

Most Activity

VIEWS864LIKES3RETWEETS1REPLIES1

Rohan Paul@rohanpaul_ai

Read their detailed blog here.

https://www.heidihealth.com/blog/clinical-ai-model-fine-tuning