/Tech3h ago

OpenAI Safety And Alignment Hosts Mixer At ICML With Hiring Teams

3141127910.2K

Original post

OpenAI safety + alignment will be hosting a mixer at ICML & there are several teams attending who are actively hiring. If you're interested in attending our safety & alignment event/meeting teams at ICML, please fill out the below form!

2:58 PM · Jun 16, 2026 · 8.8K Views

Sentiment

Users are excited about the OpenAI Safety And Alignment mixer at ICML because they view the hiring teams as cool and anticipate meeting them.

Pos

100.0%

Neg

0.0%

1 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Related links

viewform

GOOGLE.COMVia

#1212

Posts from X

Most Activity

VIEWS1.3KBOOKMARKS11LIKES14REPLIES1

Jasmine Wang@j_asminewang

https://bit.ly/icml-safety <- fill out this form!

Jasmine Wang@j_asminewang

3h1.3K1411

Jasmine Wang@j_asminewang

And at the training level, Alignment Training studies how durable behaviors emerge across pre-, mid-, and post-training. The team builds data, objectives, and evals to help models follow intent, reason reliably, express uncertainty, and act honestly in new situations.

3h30

Jasmine Wang@j_asminewang

I sadly won't personally be in Seoul but excited for y'all to meet these cool teams!

3h1702

Jasmine Wang@j_asminewang

Teams attending include: 1) Preparedness, which tracks frontier risks and works on mitigations 2) Pretraining Safety, which builds safer base models 3) Honesty & Reliability, which tackles hallucination, uncertainty, and deception

3h13

Jasmine Wang@j_asminewang

4) Robustness team, focused on making models more resilient to adversarial pressure 5) Trustworthy AI, focused on collective alignment and safety work grounded in societal impact, and third-party assurances

3h8

Jasmine Wang@j_asminewang

A related question is whether we can understand what models are doing as they reason. The CoT Monitorability team is developing ways to tell when CoTs are faithful and legible, what improves or degrades that signal, and whether reasoning can reveal latent behavior.

3h8

Jasmine Wang@j_asminewang

Several teams attending are also focused on how we scale alignment as models become more capable. Alignment Scaling works on synthetic data, reasoning-based reward models, and automated red teaming, with the longer-term goal of training models to help with alignment research.

3h7