arxiv: 2605.08128 · v1 · submitted 2026-05-01 · 💻 cs.LG · cs.AI

Recognition: 2 theorem links

· Lean Theorem

Towards Universal Gene Regulatory Network Inference: Unlocking Generalizable Regulatory Knowledge in Single-cell Foundation Models

Jiaxin Qi , Hang Li , Yan Cui , Yuhua Zheng , Jianqiang Huang

Authors on Pith no claims yet

Pith reviewed 2026-05-12 01:12 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords gene regulatory networkssingle-cell foundation modelsGRN inferencevirtual value perturbationgradient trajectoryzero-shot generalizationregulatory signal distillation

0 comments

The pith

Single-cell foundation models contain extractable regulatory knowledge that enables accurate gene network inference on unseen genes and datasets.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Gene regulatory networks describe how genes control each other in cells, and inferring them from single-cell data is key to understanding biology. Single-cell foundation models, which are large AI systems trained on vast transcriptomic data, were expected to improve this inference but have not performed well because their training does not explicitly learn regulatory patterns. The authors address this by creating a benchmark that tests how well models predict regulations for genes and datasets they have not seen before. They then develop Virtual Value Perturbation, which involves changing gene expression values in the model, and Gradient Trajectory, which follows how predictions change along input gradients, to pull out generalizable regulatory features from the models. Tests show these techniques lead to better predictions than previous approaches, pointing to a new way to use foundation models for this task.

Core claim

The paper claims that Virtual Value Perturbation and Gradient Trajectory can distill implicit regulatory signals from single-cell foundation models into inter-gene features that support generalizable GRN inference, as demonstrated by superior performance on a new benchmark for predictions on unseen genes and datasets.

What carries the argument

Virtual Value Perturbation and Gradient Trajectory methods that perturb gene values or analyze gradient paths in the scFM to derive generalizable inter-gene regulatory features.

If this is right

The approach allows GRN inference without retraining for new genes or cell types.
It outperforms standard methods on the generalization benchmark.
It provides a framework for extracting biological knowledge from foundation models beyond their original training objectives.
Traditional reconstruction-based pre-training in scFMs is insufficient for regulatory tasks.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the methods work, similar distillation could improve other inferences like cell type classification or drug response prediction from the same models.
The benchmark could be extended to test generalization across species or disease states.
Success here suggests that foundation models encode more biological structure than their training losses directly reveal.
Practically, this might reduce the need for large labeled GRN datasets by leveraging unlabeled single-cell data through models.

Load-bearing premise

The virtual perturbations and gradient trajectories isolate genuine regulatory relationships encoded in the foundation model rather than noise or training artifacts.

What would settle it

An experiment showing that the performance gains disappear when the foundation model is replaced with a random embedding or when known non-regulatory gene pairs are used as controls would indicate the methods are not capturing true signals.

Figures

Figures reproduced from arXiv: 2605.08128 by Hang Li, Jianqiang Huang, Jiaxin Qi, Yan Cui, Yuhua Zheng.

**Figure 1.** Figure 1: (a) Traditional GRN inference operates in a closed-world setting, where optimized fθ struggles with dimension mismatches on unseen genes from heterogeneous datasets. (b) Our UGRN setting utilizes frozen scFMs for universal feature extraction, enabling the generalization of regulatory predictions by “translator” fϕ to open-world scenarios involving unseen genes and datasets. sequencing (scRNA-seq) have pro… view at source ↗

**Figure 2.** Figure 2: Illustration of methods for extracting eij from scFM M. The figure compares baselines (a) Perturbation and (b) Embedding against our proposed (c) Virtual Value Perturbation and (d) Gradient Trajectory. Green and blue blocks highlight the entries corresponding to source gene gi and target gene gj , respectively, while grey blocks represent background genes. Note that (a) utilizes real mean expression x¯, wh… view at source ↗

read the original abstract

Gene Regulatory Network (GRN) inference is essential for understanding complex cellular mechanisms, rendered tractable through single-cell transcriptomic data. With the emergence of single-cell Foundation Models (scFMs), enhanced transcriptomic encoding is widely expected to revolutionize GRN inference. However, we observe that their performance remains far from satisfactory. The primary reason is that the standard reconstruction-based pre-training objectives often fail to explicitly capture latent regulatory signals. To bridge this gap, we first introduce a GRN generalization benchmark designed to evaluate regulatory predictions on unseen genes and datasets, which relies on the zero-shot capabilities of scFMs and is inherently challenging for traditional methods. Furthermore, to unlock the regulatory knowledge within the foundation models, we propose two novel methods, Virtual Value Perturbation and Gradient Trajectory, to distill implicit regulatory information from scFMs into highly generalizable inter-gene features. Extensive experiments demonstrate that our approach significantly outperforms existing methods, establishing a new paradigm for leveraging the potential of scFMs in universal GRN inference.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper introduces a zero-shot GRN generalization benchmark and two distillation methods from scFMs, but the claims rest on unverified assumptions that the extracted features capture regulatory signals rather than embedding artifacts.

read the letter

The main things to know are that the authors define a benchmark testing GRN prediction on genes and datasets outside the training distribution, and they describe two ways to pull features out of frozen single-cell foundation models using value perturbations and gradient paths. These are presented as ways to get around the fact that standard reconstruction pre-training does not explicitly target regulatory structure.

Referee Report

2 major / 2 minor

Summary. The paper introduces a GRN generalization benchmark for zero-shot evaluation of regulatory predictions on unseen genes and datasets using single-cell foundation models (scFMs). It argues that standard reconstruction-based pre-training fails to capture latent regulatory signals and proposes two distillation methods—Virtual Value Perturbation and Gradient Trajectory—to extract generalizable inter-gene features from frozen scFMs. Extensive experiments are claimed to show significant outperformance over existing methods, establishing a new paradigm for universal GRN inference.

Significance. If the central claims hold after addressing validation gaps, the work would provide a concrete mechanism for unlocking implicit regulatory knowledge in scFMs, moving beyond co-expression or reconstruction objectives. The introduction of a dedicated zero-shot generalization benchmark is a clear strength that could become a standard evaluation tool. However, without evidence that the proposed methods isolate causal regulatory structure rather than embedding artifacts, the significance remains provisional.

major comments (2)

[Methods (Virtual Value Perturbation and Gradient Trajectory subsections)] The central claim that Virtual Value Perturbation and Gradient Trajectory distill causal regulatory signals (rather than co-expression statistics, batch effects, or model-specific biases) is load-bearing for the abstract's assertion of a 'new paradigm.' No negative controls are described, such as edge permutation while preserving marginals or comparison against random-walk features on the same embedding space, leaving open the possibility that outperformance reflects better exploitation of pre-training correlations.
[Experiments] Experiments section: the generalization benchmark is presented as inherently challenging for traditional methods, yet no ablations, statistical significance tests, number of independent runs, or ground-truth edge validation details are provided to support the 'significantly outperforms' claim. This directly undermines assessment of whether the distilled features generalize on true regulatory structure.

minor comments (2)

[Abstract] The abstract states 'extensive experiments' but supplies no metrics, baselines, or controls; the full manuscript should ensure these are explicitly tabulated with effect sizes.
[Preliminaries] Notation for inter-gene features extracted by the two methods should be defined consistently with the benchmark evaluation protocol to avoid ambiguity in zero-shot transfer.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive review. The comments highlight important gaps in validation that we will address through targeted revisions to strengthen the evidence for our claims.

read point-by-point responses

Referee: [Methods (Virtual Value Perturbation and Gradient Trajectory subsections)] The central claim that Virtual Value Perturbation and Gradient Trajectory distill causal regulatory signals (rather than co-expression statistics, batch effects, or model-specific biases) is load-bearing for the abstract's assertion of a 'new paradigm.' No negative controls are described, such as edge permutation while preserving marginals or comparison against random-walk features on the same embedding space, leaving open the possibility that outperformance reflects better exploitation of pre-training correlations.

Authors: We agree that explicit negative controls are necessary to support the claim that the proposed methods extract regulatory structure rather than pre-training artifacts or correlations. The current manuscript does not include such controls. In the revision we will add (i) edge-permutation baselines that preserve degree and marginal distributions and (ii) random-walk feature baselines computed directly on the frozen scFM embedding space. These controls will be reported alongside the main results to quantify how much of the observed generalization is attributable to our distillation procedures versus generic embedding properties. revision: yes
Referee: [Experiments] Experiments section: the generalization benchmark is presented as inherently challenging for traditional methods, yet no ablations, statistical significance tests, number of independent runs, or ground-truth edge validation details are provided to support the 'significantly outperforms' claim. This directly undermines assessment of whether the distilled features generalize on true regulatory structure.

Authors: We acknowledge that the experimental reporting is currently insufficient to allow full assessment of statistical robustness and ground-truth fidelity. In the revised manuscript we will (a) include ablation studies removing each component of Virtual Value Perturbation and Gradient Trajectory, (b) report p-values from paired statistical tests across datasets, (c) state that all quantitative results are averaged over five independent runs with different random seeds, and (d) expand the description of ground-truth construction to detail the exact databases and filtering criteria used for edge validation. These additions will directly address concerns about whether performance gains reflect true regulatory generalization. revision: yes

Circularity Check

0 steps flagged

No significant circularity; derivation relies on proposed extraction methods and external benchmarks rather than self-referential definitions.

full rationale

The paper's core chain introduces a new zero-shot GRN generalization benchmark and two distillation procedures (Virtual Value Perturbation, Gradient Trajectory) whose outputs are evaluated by outperformance on held-out genes/datasets. No equations or steps reduce a claimed prediction to a fitted parameter by construction, nor does any load-bearing premise collapse to a self-citation whose validity is presupposed. The methods are presented as novel extraction steps whose success is measured against independent baselines and generalization splits, keeping the argument self-contained against external validation.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no mathematical formulation, so no free parameters, axioms, or invented entities can be identified.

pith-pipeline@v0.9.0 · 5485 in / 1211 out tokens · 70122 ms · 2026-05-12T01:12:03.837429+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Virtual Value Perturbation and Gradient Trajectory to distill implicit regulatory information from scFMs into highly generalizable inter-gene features
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

UGRN framework... Leave-One/Some-Dataset-Out protocol

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

39 extracted references · 39 canonical work pages

[1]

Science , volume=

A genomic regulatory network for development , author=. Science , volume=. 2002 , publisher=. doi:10.1126/science.1069883 , url=

work page doi:10.1126/science.1069883 2002
[2]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

A Simple and Comprehensive Benchmark for Single-Cell Transcriptomics , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=

work page
[3]

Nature Methods , volume=

Toward universal cell embeddings: integrating single-cell RNA-seq datasets across species with SATURN , author=. Nature Methods , volume=. 2024 , publisher=

work page 2024
[4]

Nature Methods , volume=

SCENIC: single-cell regulatory network inference and clustering , author=. Nature Methods , volume=. 2017 , publisher=

work page 2017
[5]

Nature Methods , volume=

Benchmarking algorithms for gene regulatory network inference from single-cell transcriptomic data , author=. Nature Methods , volume=. 2020 , publisher=

work page 2020
[6]

Proceedings of the National Academy of Sciences , volume=

Cluster analysis and display of genome-wide expression patterns , author=. Proceedings of the National Academy of Sciences , volume=. 1998 , publisher=

work page 1998
[7]

PLoS ONE , volume=

Inferring regulatory networks from expression data using tree-based methods , author=. PLoS ONE , volume=. 2010 , publisher=

work page 2010
[8]

2025 , volume =

Cui, Tianyu and Xu, Song-Jun and Moskalev, Artem and Li, Shuwei and Mansi, Tommaso and Prakash, Mangal and Liao, Rui , booktitle =. 2025 , volume =

work page 2025
[9]

Cell Research , volume=

GeneCompass: deciphering universal gene regulatory mechanisms with a knowledge-informed cross-species foundation model , author=. Cell Research , volume=. 2024 , publisher=

work page 2024
[10]

Nature Methods , volume=

Large-scale foundation model on single-cell transcriptomics , author=. Nature Methods , volume=. 2024 , publisher=

work page 2024
[11]

scGPT: toward building a foundation model for single-cell multi-omics using generative AI

scGPT: toward building a foundation model for single-cell multi-omics using generative AI , author=. Nature Methods , volume=. 2024 , publisher=. doi:10.1038/s41592-024-02201-0 , url=

work page doi:10.1038/s41592-024-02201-0 2024
[12]

Nature , volume=

Transfer learning enables predictions in network biology , author=. Nature , volume=. 2023 , publisher=. doi:10.1038/s41586-023-06139-9 , url=

work page doi:10.1038/s41586-023-06139-9 2023
[13]

Nature Machine Intelligence , volume=

scBERT as a large-scale pretrained deep language model for cell type annotation of single-cell RNA-seq data , author=. Nature Machine Intelligence , volume=. 2022 , publisher=

work page 2022
[14]

Unraveling the regulatory dynamics of bidirectional promoters for modulating gene co-expression and metabolic flux in

Jin, Zimo and Dong, Yueming and Rafi, Abdul Muntakim and Patwary, Md Mohsin and Xu, Catherine and Raadam, Morten H and de Boer, Carl G and Ignea, Codruta , journal=. Unraveling the regulatory dynamics of bidirectional promoters for modulating gene co-expression and metabolic flux in. 2025 , publisher=. doi:10.1093/nar/gkaf511 , url=

work page doi:10.1093/nar/gkaf511 2025
[15]

Genome Biology , volume=

Biology-driven insights into the power of single-cell foundation models , author=. Genome Biology , volume=. 2025 , publisher=. doi:10.1186/s13059-025-03781-6 , url=

work page doi:10.1186/s13059-025-03781-6 2025
[16]

Nature Methods , volume=

Deep-learning-based gene perturbation effect prediction does not yet outperform simple linear baselines , author=. Nature Methods , volume=. 2025 , publisher=. doi:10.1038/s41592-025-02772-6 , url=

work page doi:10.1038/s41592-025-02772-6 2025
[17]

Genome Biology , volume=

Zero-shot evaluation reveals limitations of single-cell foundation models , author=. Genome Biology , volume=. 2025 , publisher=. doi:10.1186/s13059-025-03574-x , url=

work page doi:10.1186/s13059-025-03574-x 2025
[18]

Proceedings of the National Academy of Sciences , volume=

Deep learning for inferring gene relationships from single-cell expression data , author=. Proceedings of the National Academy of Sciences , volume=. 2019 , publisher=. doi:10.1073/pnas.1911536116 , url=

work page doi:10.1073/pnas.1911536116 2019
[19]

Nature Reviews Genetics , volume=

Network biology: understanding the cell's functional organization , author=. Nature Reviews Genetics , volume=. 2004 , publisher=. doi:10.1038/nrg1272 , url=

work page doi:10.1038/nrg1272 2004
[20]

BMC Bioinformatics , volume=

Comparison of co-expression measures: mutual information, correlation, and model based indices , author=. BMC Bioinformatics , volume=. 2012 , publisher=. doi:10.1186/1471-2105-13-328 , url=

work page doi:10.1186/1471-2105-13-328 2012
[21]

Cell , volume=

Comprehensive Integration of Single-Cell Data , author=. Cell , volume=. 2019 , publisher=. doi:10.1016/j.cell.2019.05.031 , url=

work page doi:10.1016/j.cell.2019.05.031 2019
[22]

Cell Reports , volume=

Understanding Tissue-Specific Gene Regulation , author=. Cell Reports , volume=. 2017 , publisher=. doi:10.1016/j.celrep.2017.10.032 , url=

work page doi:10.1016/j.celrep.2017.10.032 2017
[23]

Nature Methods , volume=

NicheNet: modeling intercellular communication by linking ligands to target genes , author=. Nature Methods , volume=. 2020 , publisher=. doi:10.1038/s41592-019-0667-5 , url=

work page doi:10.1038/s41592-019-0667-5 2020
[24]

International Conference on Learning Representations (ICLR) , year=

Adam: A Method for Stochastic Optimization , author=. International Conference on Learning Representations (ICLR) , year=

work page
[25]

iScience , volume =

scGREAT: Transformer-Based Deep-Language Model for Gene Regulatory Network Inference from Single-Cell Transcriptomics , author =. iScience , volume =. 2024 , publisher =

work page 2024
[26]

Bioinformatics , volume=

Prediction of gene regulatory connections with joint single-cell foundation models and graph-based learning , author=. Bioinformatics , volume=. 2025 , month=. doi:10.1093/bioinformatics/btaf217 , url=

work page doi:10.1093/bioinformatics/btaf217 2025
[27]

BMC Systems Biology , volume=

TIGRESS: Trustful Inference of Gene REgulation using Stability Selection , author=. BMC Systems Biology , volume=. 2012 , publisher=. doi:10.1186/1752-0509-6-145 , url=

work page doi:10.1186/1752-0509-6-145 2012
[28]

Bioinformatics , volume=

GRNBoost2 and Arboreto: efficient and scalable inference of gene regulatory networks , author=. Bioinformatics , volume=. 2019 , publisher=

work page 2019
[29]

2006 , publisher=

Margolin, Adam A and Nemenman, Ilya and Basso, Katia and Wiggins, Chris and Stolovitzky, Gustavo and Dalla Favera, Riccardo and Califano, Andrea , journal=. 2006 , publisher=

work page 2006
[30]

Large-scale mapping and validation of

Faith, Jeremiah J and Hayete, Boris and Thaden, Joshua T and Mogno, Ilaria and Wierzbowski, Jamey and Cottarel, Guillaume and Kasif, Simon and Collins, James J and Gardner, Timothy S , journal=. Large-scale mapping and validation of. 2007 , publisher=

work page 2007
[31]

Bioinformatics , volume=

Graph attention network for link prediction of gene regulations from single-cell RNA-sequencing data , author=. Bioinformatics , volume=. 2022 , publisher=

work page 2022
[32]

Bioinformatics , volume=

GNNLink: a graph neural network framework for gene regulatory network inference , author=. Bioinformatics , volume=. 2021 , publisher=

work page 2021
[33]

Bioinformatics , volume=

Modeling gene regulatory networks using neural network architectures , author=. Bioinformatics , volume=. 2021 , publisher=

work page 2021
[34]

Nature Communications , volume=

scPRINT: pre-training on 50 million cells allows robust gene network predictions , author=. Nature Communications , volume=. 2025 , publisher=. doi:10.1038/s41467-025-58699-1 , url=

work page doi:10.1038/s41467-025-58699-1 2025
[35]

Nucleic acids research , volume=

STRING v11: protein--protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets , author=. Nucleic acids research , volume=. 2019 , publisher=

work page 2019
[36]

Database , volume=

ESCAPE: database for integrating high-content published data collected from human and mouse embryonic stem cells , author=. Database , volume=. 2013 , publisher=

work page 2013
[37]

Nature , volume=

Expanded encyclopaedias of DNA elements in the human and mouse genomes , author=. Nature , volume=. 2020 , publisher=

work page 2020
[38]

Genome research , volume=

Benchmark and integration of resources for the estimation of human transcription factor activities , author=. Genome research , volume=. 2019 , publisher=

work page 2019
[39]

Database , volume=

RegNetwork: an integrated database of transcriptional and post-transcriptional regulatory networks in human and mouse , author=. Database , volume=. 2015 , publisher=

work page 2015