/Tech10h ago

Baichuan Releases M4 Clinical Medical Agent Scoring 55.1 on HealthBench

6260309.7K

#385

Original post

Tanishq Mathew Abraham, Ph.D.@iScienceLuvr#385inTech

Baichuan-M4: A Clinical-Grade Medical Agent System for Continuous Care

Achieves 55.1 on HealthBench Professional, beating GPT 5.5

Context: Baichuan is one of the prominent AI startups/labs in China, mostly focusing on AI in healthcare. They've previously released Baichuan-M1 through M3, along with technical reports.

They have now released a technical report for Baichuan-M4, although it is not open-source :(

Baichuan-M4 is designed as a clinical-grade medical agent system, supporting patient consultation, follow-up, continuous care, evidence-based retrieval, medical image understanding, long-term patient memory, and multi-agent coordination in controlled environments.

RL training: "SPAR++ replaces coarse-grained scoring of an entire dialogue trajectory with reward signals anchored to key clinical spans. The model is not only rewarded for reaching the correct final conclusion, but also for sufficient history taking, timely risk identification, and appropriate tool use."

"In mixed initial-visit and follow-up scenarios, M4 uses a curriculum learning strategy [9] of “building the foundation with initial visits first, then improving performance with follow-ups."

Baichuan-M4 is trained with tools for dynamic memory management, retrieval of authoritative medical evidence, and multimodal perception (OCR+X-ray+dermatology).

4:13 AM · Jun 9, 2026 · 5.6K Views

/Tech10h ago

Baichuan Releases M4 Clinical Medical Agent Scoring 55.1 on HealthBench

6260309.7K

#385

Original post

Tanishq Mathew Abraham, Ph.D.@iScienceLuvr#385inTech

Baichuan-M4: A Clinical-Grade Medical Agent System for Continuous Care

Achieves 55.1 on HealthBench Professional, beating GPT 5.5

Context: Baichuan is one of the prominent AI startups/labs in China, mostly focusing on AI in healthcare. They've previously released Baichuan-M1 through M3, along with technical reports.

They have now released a technical report for Baichuan-M4, although it is not open-source :(

"In mixed initial-visit and follow-up scenarios, M4 uses a curriculum learning strategy [9] of “building the foundation with initial visits first, then improving performance with follow-ups."

Baichuan-M4 is trained with tools for dynamic memory management, retrieval of authoritative medical evidence, and multimodal perception (OCR+X-ray+dermatology).

4:13 AM · Jun 9, 2026 · 5.6K Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Posts from X

Most Activity

VIEWS4.1KBOOKMARKS5LIKES2

Tanishq Mathew Abraham, Ph.D.@iScienceLuvr

abs: https://arxiv.org/abs/2606.08982

Tanishq Mathew Abraham, Ph.D.@iScienceLuvr

Baichuan-M4: A Clinical-Grade Medical Agent System for Continuous Care

Achieves 55.1 on HealthBench Professional, beating GPT 5.5

Context: Baichuan is one of the prominent AI startups/labs in China, mostly focusing on AI in healthcare. They've previously released Baichuan-M1 through M3, along with technical reports.

They have now released a technical report for Baichuan-M4, although it is not open-source :(

"In mixed initial-visit and follow-up scenarios, M4 uses a curriculum learning strategy [9] of “building the foundation with initial visits first, then improving performance with follow-ups."

Baichuan-M4 is trained with tools for dynamic memory management, retrieval of authoritative medical evidence, and multimodal perception (OCR+X-ray+dermatology).

10h4.1K25