/AI5h ago

Rhaister Model Predicts Drug Phenotypes With Experimental Assay Accuracy

21182447917.4K
Original postAnshul Kundaje#1650
Nima Alidoust@nalidoust

Today we release Rhaister, an elegant statistical model that predicts drug phenotypes in new contexts w/ accuracies comparable to experimental assays.

And dropping Emerald Bay, a 2M cell dataset measuring long time-course phenotypes across 1000s of drug-cell line interactions.

10:18 AM · Jun 9, 2026 · 13K Views
Sentiment

Many users celebrated the Rhaister model's ability to predict drug phenotypes at experimental assay accuracy, praising it as a huge advancement and incredible work that enables large-scale experiments.

Pos
100.0%
Neg
0.0%
7 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS1.5KREPLIES1
Anshul Kundaje@anshulkundaje

Good to see folks going back to the basics. Been saying this for a while, that the whole AI virtual cell space has been putting parameters first & expt design second (or not at all) when the latter is the most important to match your questions of interest. 1/

Nima Alidoust@nalidoust

Today we release Rhaister, an elegant statistical model that predicts drug phenotypes in new contexts w/ accuracies comparable to experimental assays.

And dropping Emerald Bay, a 2M cell dataset measuring long time-course phenotypes across 1000s of drug-cell line interactions.

36mViews 1.5KLikes 13Bookmarks 4
BOOKMARKS5
Nima Alidoust@nalidoust

Unlike “virtual cell models," Rhaister goes back to the basics: it builds a minimalist perturbation model from the ground up, directly on summary statistics of the data.

Nima Alidoust@nalidoust

Today we release Rhaister, an elegant statistical model that predicts drug phenotypes in new contexts w/ accuracies comparable to experimental assays.

And dropping Emerald Bay, a 2M cell dataset measuring long time-course phenotypes across 1000s of drug-cell line interactions.

5hViews 981Likes 21Bookmarks 5
LIKES23RETWEETS8
Hani Goodarzi@genophoria

@tahoe_ai team doing what they do best! A key takeaway from VCC’25 for us was that in the current data regimes, elegant statistical models match if not surpass far more complex transformer-based architectures! “Going back to the basics” is such a great title for this… to say nothing of the 2M cell dataset! Kudos…

Nima Alidoust@nalidoust

Today we release Rhaister, an elegant statistical model that predicts drug phenotypes in new contexts w/ accuracies comparable to experimental assays.

And dropping Emerald Bay, a 2M cell dataset measuring long time-course phenotypes across 1000s of drug-cell line interactions.

5hViews 1.1KLikes 23Bookmarks 1
Nima Alidoust@nalidoust

A more in-depth tweetorial from @vallens

5hViews 907Likes 4Bookmarks 4
Nima Alidoust@nalidoust

It has been a wonderful surprise for us to see such an interpretable, inexpensive model (trained in seconds, predictions in milliseconds), accomplish what virtual cell models (typically with far more complex architectures) promised to eventually do.

5hViews 1.4KLikes 5Bookmarks 1
Nima Alidoust@nalidoust

These results demonstrate we can accomplish a lot by going back to basics and building models that, by design, reflect the statistics of the underlying data. Rhaister shows that scaling the right data is far more valuable than scaling parameters.

5hViews 434Likes 7Bookmarks 0
Anshul Kundaje@anshulkundaje

Also will once again iterate that one can develop simple approaches that can predict perturbations that are in distribution very well without ever learning a causal model of gene regulation. 2/

Anshul Kundaje@anshulkundaje

Good to see folks going back to the basics. Been saying this for a while, that the whole AI virtual cell space has been putting parameters first & expt design second (or not at all) when the latter is the most important to match your questions of interest. 1/

36mViews 346Likes 4Bookmarks 1
Nima Alidoust@nalidoust

And it is capable of predicting more complex drug phenotypes such as sensitivity in cancer cells far beyond simple baselines. And there is room to make it much better, so stay tuned.

5hViews 205Likes 2Bookmarks 1
Nima Alidoust@nalidoust

With a handful of example perturbations in a new context, Rhaister predicts responses for other perturbations with accuracies within experimental noise, exceeding state of the art virtual cell models in performance.

5hViews 259Likes 4
Shreshth Gandhi@shreshth_gandhi

@nalidoust So excited for this to be finally out! Incredible work led by @vallens . Rhaister allowed us to do launch hundreds of dataset scaling experiments in hours in what would have taken months with the previous generation of perturbation models 🚀🚀

5hViews 278Likes 4
Nima Alidoust@nalidoust

It is the first model we have seen that performs significantly beyond mean baselines in a zero-shot setting; a task commonly proposed as a promise of virtual cell models.

5hViews 247Likes 3
Nima Alidoust@nalidoust

Despite its simplicity, it is the first model that shows consistent scaling with more perturbation data.

5hViews 217Likes 3
Nima Alidoust@nalidoust

Read more Paper: https://tahoebio-assets.com/rhaister-manuscript.pdf Model and datasets: https://huggingface.co/collections/tahoebio/rhaister

5hViews 285Likes 2
Nima Alidoust@nalidoust

What excites us more is what comes next: fast and interpretable, Rhaister is far better suited to advance biological reasoning in close iteration with the emerging agentic workflows.

5hViews 248Likes 2
Nima Alidoust@nalidoust

We show that by testing it against our very unique Emerald Bay dataset, generated using our Mosaic platform, measuring 5-day sensitivity of cancer models to various drugs. And we are open-sourcing that dataset along with Rhaister.

5hViews 209Likes 2

@nalidoust Here's to another moon landing 🚀💙

5hViews 412Likes 2
Purav Gupta@PuravGupta5

@nalidoust Huge advancement for the field 🔥💯

4hViews 63Likes 2
Pedram Bayat@Bayat_Pedram

@nalidoust Let’s goooo! 🚀🚀

5hViews 98Likes 1
minjae@minjaekwon_

@nalidoust 🚀🚀🚀

5hViews 83
Load more posts