/AI9h ago

Study Shows Single-Cell Foundation Models Plateau Early, Ignoring Scale

99624656.3K
Original postAnshul Kundaje#1650
Jorge Bravo Abad@bravo_abad

No scaling laws for single-cell foundation models: when bigger atlases stop teaching the model anything

In language and vision, the recipe has been simple: more data, bigger models, better performance. Single-cell biology borrowed that playbook. Foundation models for transcriptomics jumped from 1 million cells to atlases of over 100 million, on the assumption that scale would unlock the same gains. Alan DenAdel and coauthors put that assumption to the test, and the result is sobering.

Working from a 22.2-million-cell corpus, they pretrained 400 models across five architectures (from PCA and a variational autoencoder up to the Geneformer transformer) and ran 6,400 evaluation experiments. They varied not just dataset size (1% to 75%) but also diversity, using cell-type re-weighting and geometric sketching to deliberately enrich rare cell types and transcriptional states.

The finding: performance saturates almost immediately. On cell-type classification, batch integration, and perturbation prediction, most models hit their ceiling at roughly 1% of the corpus, about 200,000 cells. Beyond that, adding millions more cells changed essentially nothing. More diversity didn't help. Even spiking in genome-scale Perturb-seq data, to give the models perturbed phenotypes rather than just healthy ones, failed to move the needle. Larger models did score better overall, but they too plateaued early on data.

Two points stood out. Simple baselines (PCA, logistic regression) often matched or beat the transformers. And the strongest model, SCimilarity, won not because of size but because its contrastive training objective is aligned with the downstream task. For single-cell data, what you train on and how you frame the objective matters far more than how much you collect.

This reframes a quiet but expensive habit. In drug discovery, biotech, and any pipeline leaning on cell atlases, the instinct to keep scaling pretraining corpora may be burning compute for no return. The real leverage sits elsewhere: curating high-quality, task-relevant data and matching the training objective to the actual question you're trying to answer.

Paper: DenAdel et al., journal license | https://doi.org/10.1038/s41592-026-03120-y

6:59 AM · Jun 9, 2026 · 6.3K Views
Sentiment

Users call the finding that single-cell foundation models plateau around 200k cells one of the more important recent results in AI for biology.

Pos
100.0%
Neg
0.0%
1 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS55

This is one of the more important results in AI for biology lately.

The finding that performance saturates at ~200k cells (and that simple baselines can match or beat large transformers) suggests *single-cell data* has fundamentally different scaling properties than language or images. The structure is sparser, noisier, and more task-specific.

What stands out is that objective alignment and data quality/curations mattered far more than volume. In drug discovery and perturbation modelling, this feels like a signal to stop treating cell atlases as generic pretraining fuel and instead focus on building objectives that directly reflect the downstream biological questions.

The “bitter lesson” has limits. In some domains, better inductive biases and task framing beat brute-force scale.

9hViews 55
theorema@theorema_ai

@bravo_abad if the bitter lesson does not apply to cell biology, then the key unlock is not x$B and the field becomes a lot more open.

8hViews 39