5h ago

Stanford researchers release 60-page arXiv paper 'Reflections and New Directions for Human-Centered Large Language Models' from Diyi Yang's CS329X class with contributions from more than 60 students

Lead authors include Caleb Ziems, Dora Zhao, and Diyi Yang.

0
Original post

The next frontier of AI is not only more capable model; it is an AI that *humans* can meaningfully live and work with :) With all students in my cs329x Human-Centered LLM class, we present 60+ pages of insights for developing Human-Centered LLMs (HCLLMs), from design & data sourcing to training, eval & deployment 🧵

8:53 AM · May 20, 2026 View on X
Reposted by

Link here: https://arxiv.org/pdf/2605.06901

Diyi YangDiyi Yang@Diyi_Yang

The next frontier of AI is not only more capable model; it is an AI that *humans* can meaningfully live and work with :) With all students in my cs329x Human-Centered LLM class, we present 60+ pages of insights for developing Human-Centered LLMs (HCLLMs), from design & data sourcing to training, eval & deployment 🧵

3:53 PM · May 20, 2026 · 10K Views
3:53 PM · May 20, 2026 · 761 Views

How do we center humans? We start with HCLLM DESIGN to understand (1) stakeholders, (2) design challenges, and (3) HCI solutions.

E.g., HCI interfaces can close the gap between user intentions and prompts -- a distinct AI design challenge called the "gulf of envisioning"

Diyi YangDiyi Yang@Diyi_Yang

Link here: https://arxiv.org/pdf/2605.06901

3:53 PM · May 20, 2026 · 761 Views
3:53 PM · May 20, 2026 · 439 Views

Next, we look at HCLLM DEVELOPMENT from an NLP perspective.

First: training data reflects the people, institutions, cultures, histories, and social contexts that produced it. From data origins, we can understand human-centered concerns around bias, privacy, and data ownership.

Diyi YangDiyi Yang@Diyi_Yang

How do we center humans? We start with HCLLM DESIGN to understand (1) stakeholders, (2) design challenges, and (3) HCI solutions. E.g., HCI interfaces can close the gap between user intentions and prompts -- a distinct AI design challenge called the "gulf of envisioning"

3:53 PM · May 20, 2026 · 439 Views
3:53 PM · May 20, 2026 · 184 Views

Second: in post-training, we highlight tensions and tradeoffs around preference tuning, personalization, & pluralism, tying concerns back to our earlier "data" discussion around bias & representation.

We also explore the limitations of scaling laws for human-centered objectives.

Diyi YangDiyi Yang@Diyi_Yang

Next, we look at HCLLM DEVELOPMENT from an NLP perspective. First: training data reflects the people, institutions, cultures, histories, and social contexts that produced it. From data origins, we can understand human-centered concerns around bias, privacy, and data ownership.

3:53 PM · May 20, 2026 · 184 Views
3:53 PM · May 20, 2026 · 158 Views

Third, we highlight pitfalls and best practices w/ evals at 3 different scopes: (1) model output, (2) human experience, (3) societal impact.

Currently, the field relies too much on surface heuristics and not enough on orienting HCLLMs towards long-term collective good objectives

Diyi YangDiyi Yang@Diyi_Yang

Second: in post-training, we highlight tensions and tradeoffs around preference tuning, personalization, & pluralism, tying concerns back to our earlier "data" discussion around bias & representation. We also explore the limitations of scaling laws for human-centered objectives.

3:53 PM · May 20, 2026 · 158 Views
3:53 PM · May 20, 2026 · 152 Views

After design and development, we look at responsible HCLLM DEPLOYMENT. There is tension between 3 ideals: (1) interpretability, (2) steerability, & (3) safety.

E.g, certain alignment and steering methods can make models less interpretable; more steerable models may be less safe.

Diyi YangDiyi Yang@Diyi_Yang

Third, we highlight pitfalls and best practices w/ evals at 3 different scopes: (1) model output, (2) human experience, (3) societal impact. Currently, the field relies too much on surface heuristics and not enough on orienting HCLLMs towards long-term collective good objectives

3:53 PM · May 20, 2026 · 152 Views
3:53 PM · May 20, 2026 · 128 Views

In summary, we show human-centeredness is more than an alignment objective. It is a design approach across the entire HCLLM pipeline, from development to deployment. There are no simple answers.

To make this concrete, we conclude w/ a case study on HCLLMs and the Future of Work.

Diyi YangDiyi Yang@Diyi_Yang

After design and development, we look at responsible HCLLM DEPLOYMENT. There is tension between 3 ideals: (1) interpretability, (2) steerability, & (3) safety. E.g, certain alignment and steering methods can make models less interpretable; more steerable models may be less safe.

3:53 PM · May 20, 2026 · 128 Views
3:53 PM · May 20, 2026 · 225 Views

This paper was jointly led by @cjziems & @dorazhao9, w/ contributions from over 60 authors across @stanfordnlp and @StanfordHAI, and cs329x! We build on lots of wisdom & empirical contributions of countless researchers across HCI, NLP, and Responsible AI.

Thank you to everyone involved ❤️

Diyi YangDiyi Yang@Diyi_Yang

In summary, we show human-centeredness is more than an alignment objective. It is a design approach across the entire HCLLM pipeline, from development to deployment. There are no simple answers. To make this concrete, we conclude w/ a case study on HCLLMs and the Future of Work.

3:53 PM · May 20, 2026 · 225 Views
3:53 PM · May 20, 2026 · 210 Views

Tired of AI automating research and solving major open problems? Take a read at Diyi’s new masterpiece to gain more confidence on humans!

Diyi YangDiyi Yang@Diyi_Yang

The next frontier of AI is not only more capable model; it is an AI that *humans* can meaningfully live and work with :) With all students in my cs329x Human-Centered LLM class, we present 60+ pages of insights for developing Human-Centered LLMs (HCLLMs), from design & data sourcing to training, eval & deployment 🧵

3:53 PM · May 20, 2026 · 10K Views
8:47 PM · May 20, 2026 · 257 Views

@ChengleiSi lol

CLSCLS@ChengleiSi

Tired of AI automating research and solving major open problems? Take a read at Diyi’s new masterpiece to gain more confidence on humans!

8:47 PM · May 20, 2026 · 257 Views
8:54 PM · May 20, 2026 · 21 Views

This is like the avengers but for human AI interaction

Diyi YangDiyi Yang@Diyi_Yang

The next frontier of AI is not only more capable model; it is an AI that *humans* can meaningfully live and work with :) With all students in my cs329x Human-Centered LLM class, we present 60+ pages of insights for developing Human-Centered LLMs (HCLLMs), from design & data sourcing to training, eval & deployment 🧵

3:53 PM · May 20, 2026 · 10K Views
4:36 PM · May 20, 2026 · 3.7K Views
Stanford researchers release 60-page arXiv paper 'Reflections and New Directions for Human-Centered Large Language Models' from Diyi Yang's CS329X class with contributions from more than 60 students · Digg