/Tech1h ago

OpenAI launches GeneBench-Pro to evaluate AI agents on computational biology, with top models scoring under 35%

OpenAI's gpt-5.6 sol (Pro) topped the benchmark at 31.5%.

2021.7K127248149.8K

#1939

Original post

OpenAI@OpenAI

We’re introducing GeneBench-Pro, a research-level benchmark for a harder kind of AI progress: how well agents can navigate messy biological data, choose the right analysis path, and make judgment calls that real computational research depends on. https://openai.com/index/introducing-genebench-pro/

10:10 AM · Jun 30, 2026 · 149K Views

Sentiment

Users expressed optimism about OpenAI's GeneBench-Pro benchmark for AI biological research agents because they believe AI will save humanity in the future.

Pos

100.0%

Neg

0.0%

1 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Related links

Introducing GeneBench-Pro

OPENAI.COMVia

Posts from X

Most Activity

VIEWS751BOOKMARKS1LIKES6RETWEETS1REPLIES2

Vaibhav (VB) Srivastav@reach_vb

GeneBench-Pro is a benchmark for testing whether AI agents can do realistic computational biology work and not just answer biology questions

It gives models messy research-style problems: inspect data, catch bad samples, choose the right analysis, revise assumptions, and produce a defensible conclusion

OpenAI@OpenAI

1h75161

IGNIS@ignis_code

@reach_vb I believe that AI will save humanity in the future.

1h20

🧟@RaghavKoch19380

@reach_vb How do GPT Pro models differ from normal GPT models? Is it just fanning out N concurrent normal GPT models and aggregating their results?

1h10