arXiv preprint arXiv:2401.04301 , year=

· 2024 · arXiv 2401.04301

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Learning Posterior Predictive Distributions for Node Classification from Synthetic Graph Priors

cs.LG · 2026-04-21 · unverdicted · novelty 7.0

NodePFN pre-trains on synthetic graphs with controllable homophily and causal feature-label models to achieve 71.27 average accuracy on 23 node classification benchmarks without graph-specific training.

Analogies between Transformer Layers and Power Method

cs.LG · 2026-05-25 · unverdicted · novelty 6.0

Transformer layers are analogous to power method steps, tilting tokens toward the principal eigenvector of the output-value weight product, with stronger analytical and empirical alignment in shared-weight models and a proposed steering method.

citing papers explorer

Showing 2 of 2 citing papers.

Learning Posterior Predictive Distributions for Node Classification from Synthetic Graph Priors cs.LG · 2026-04-21 · unverdicted · none · ref 268
NodePFN pre-trains on synthetic graphs with controllable homophily and causal feature-label models to achieve 71.27 average accuracy on 23 node classification benchmarks without graph-specific training.
Analogies between Transformer Layers and Power Method cs.LG · 2026-05-25 · unverdicted · none · ref 8
Transformer layers are analogous to power method steps, tilting tokens toward the principal eigenvector of the output-value weight product, with stronger analytical and empirical alignment in shared-weight models and a proposed steering method.

arXiv preprint arXiv:2401.04301 , year=

fields

years

verdicts

representative citing papers

citing papers explorer