A new benchmarking framework shows virtual cell models overestimate performance on standard tests, drop sharply on unseen contexts and perturbations, and produce inconsistent rankings across metrics.
Pooled crispr screening with single-cell transcriptome readout.Nature methods, 14(3):297–301
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
q-bio.CB 1years
2026 1verdicts
CONDITIONAL 1representative citing papers
citing papers explorer
-
Benchmarking virtual cell models for in-the-wild perturbation response
A new benchmarking framework shows virtual cell models overestimate performance on standard tests, drop sharply on unseen contexts and perturbations, and produce inconsistent rankings across metrics.