super hub Mixed citations

Jumper , author R

Alexander Pritzel, Alex Bridgland, Anna Potapenko, Augustin Žídek, Clemens Meyer, John Jumper + 2 more · 2021 · Nature · DOI 10.1038/s41586-021-03819-2

Mixed citation behavior. Most common role is background (64%).

30 Pith papers citing it

40.9k external citations · Crossref

Background 64% of classified citations

open at publisher browse 30 citing papers more from Alexander Pritzel

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

background 8 method 2 other 1

citation-polarity summary

background 7 use method 2 support 1 unclear 1

authors

Alexander Pritzel Alex Bridgland Anna Potapenko Augustin Žídek Clemens Meyer John Jumper Kathryn Tunyasuvunakool Michael Figurnov Olaf Ronneberger Richard Evans Russ Bates Tim Green

co-cited works

representative citing papers

ENSEMBITS: an alphabet of protein conformational ensembles

cs.LG · 2026-05-13 · unverdicted · novelty 8.0 · 2 refs

Ensembits is the first tokenizer of protein conformational ensembles that outperforms static tokenizers on RMSF prediction and matches them on function and mutation tasks while using less pretraining data.

Latent Process Generator Matching

cs.LG · 2026-05-19 · unverdicted · novelty 7.0

Presents a general framework for generator matching on projected image spaces from latent Markov processes, generalizing static latent results to dynamic conditional processes.

Entropy Across the Bridge: Conditional-Marginal Discretization for Flow and Schr\"odinger Samplers

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

Derives a conditional-marginal entropy-rate objective for bridge-aware discretization that yields U-shaped schedules and improves low-NFE sample quality on 2D, CIFAR-10, and protein tasks.

ProteinJEPA: Latent prediction complements protein language models

cs.LG · 2026-05-08 · unverdicted · novelty 7.0

Masked-position MLM plus JEPA latent prediction outperforms MLM-only pretraining on 10-11 of 16 downstream tasks for 35M-150M protein models while JEPA alone fails.

TCD-Arena: Assessing Robustness of Time Series Causal Discovery Methods Against Assumption Violations

cs.LG · 2026-05-04 · unverdicted · novelty 7.0

TCD-Arena is a new customizable testing framework that runs millions of experiments to map how 33 different assumption violations affect time series causal discovery methods and shows ensembles can boost overall robustness.

Rates of forgetting for the sequentially Markov coalescent

math.PR · 2026-04-22 · unverdicted · novelty 7.0

SMC forgets its initial condition geometrically in the jump chain and as 1/ℓ in continuous genetic distance, justifying independent-locus approximations.

Dual Triangle Attention: Effective Bidirectional Attention Without Positional Embeddings

q-bio.QM · 2026-04-09 · unverdicted · novelty 7.0

Dual Triangle Attention achieves effective bidirectional attention with built-in positional inductive bias via dual triangular masks, outperforming standard bidirectional attention on position-sensitive tasks and showing strong masked language modeling results with or without positional embeddings.

Stochastic Thermodynamics of Associative Memory

cond-mat.stat-mech · 2026-01-03 · unverdicted · novelty 7.0

DenseAMs show tradeoffs between entropy production, retrieval accuracy, and speed at intermediate loads, with a new failure mode in higher-order networks at finite temperature.

Accelerating Inference for Multilayer Neural Networks with Quantum Computers

quant-ph · 2025-10-08 · unverdicted · novelty 7.0

Quantum circuits for coherent multilayer neural network inference achieve quadratic to polylogarithmic speedups over classical methods depending on quantum data access models for inputs and weights.

AlphaEvolve: A coding agent for scientific and algorithmic discovery

cs.AI · 2025-06-16 · unverdicted · novelty 7.0

AlphaEvolve is an LLM-orchestrated evolutionary coding agent that discovered a 4x4 complex matrix multiplication algorithm using 48 scalar multiplications, the first improvement over Strassen's algorithm in 56 years, plus optimizations for Google data centers and hardware.

Towards Understanding Self-Pretraining for Sequence Classification

cs.LG · 2026-05-20 · unverdicted · novelty 6.0

Self-pretraining improves Transformer sequence classification by enabling learning of proximity-biased attention from positional encodings that label supervision alone cannot easily acquire from random starts.

CrystalBoltz: End-to-End Protein Structure Determination via Experiment-Guided Diffusion for X-Ray Crystallography

cs.LG · 2026-05-15 · unverdicted · novelty 6.0

CrystalBoltz performs experiment-guided posterior sampling with diffusion models on structure-factor amplitudes for protein structure determination, reporting lower RMSD and R-factors than baselines with 33x faster runtime.

ShardTensor: Domain Parallelism for Scientific Machine Learning

cs.DC · 2026-05-11 · unverdicted · novelty 6.0

ShardTensor is a domain-parallelism system for SciML that enables flexible scaling of extreme-resolution spatial datasets by removing the constraint of batch size one per device.

Supercharging Bayesian Inference with Reliable AI-Informed Priors

stat.ML · 2026-05-11 · unverdicted · novelty 6.0

Rectified AI priors, obtained by correcting AI-induced data laws before embedding them in techniques like Dirichlet process priors, reduce bias, improve credible interval coverage, and boost performance in tasks like skin disease classification.

A physics-informed neural network approach to solve the spatially inhomogeneous electron Boltzmann equation

physics.plasm-ph · 2026-05-05 · unverdicted · novelty 6.0

A specialized PINN architecture solves the spatially inhomogeneous electron Boltzmann equation with high accuracy across gases and electric field strengths without case-specific tuning.

Flashlight: PyTorch Compiler Extensions to Accelerate Attention Variants

cs.LG · 2025-11-03 · unverdicted · novelty 6.0

Flashlight is a compiler-native PyTorch framework that generates efficient fused kernels for arbitrary and data-dependent attention variants, supporting more cases than FlexAttention with competitive performance.

Fast and Interpretable Protein Substructure Alignment via Optimal Transport

q-bio.QM · 2025-10-12 · unverdicted · novelty 6.0

PLASMA applies regularized optimal transport with Sinkhorn iterations to produce fast, interpretable residue-level alignments and similarity scores between protein structures.

Improving Inverse Folding for Peptide Design with Diversity-regularized Direct Preference Optimization

cs.LG · 2024-10-25 · unverdicted · novelty 6.0

Diversity-regularized DPO fine-tuning of ProteinMPNN improves structural similarity scores by at least 8% over base model and sequence diversity by up to 20% over standard DPO for peptide inverse folding on OpenFold structures.

Enabling Structure-Only Initialization and Out-of-Distribution Generalization in GNN-based Molecular Dynamics Simulators

physics.chem-ph · 2026-05-10 · unverdicted · novelty 5.0

GNN-based MD simulators achieve stable structure-only initialization and reliable OOD generalization through inference-time physics optimization and a GNN barostat on elastic network compression tasks.

Experiment-as-Code Labs: A Declarative Stack for AI-Driven Scientific Discovery

eess.SY · 2026-05-06 · unverdicted · novelty 5.0 · 2 refs

The paper introduces Experiment-as-Code Labs as a declarative stack synthesizing AI agents, systems orchestration, and physical lab control for AI-driven discovery.

Benchmarking open-source tools for in silico antiviral drug discovery

q-bio.BM · 2026-05-05 · conditional · novelty 5.0

Boltz-2 and fine-tuned DrugFormDTA lead ML-based binding prediction while GNINA leads docking tools on a cleaned antiviral dataset, with performance varying by viral protein.

MIRA: A Score for Conditional Distribution Accuracy and Model Comparison

stat.ML · 2026-05-03 · unverdicted · novelty 5.0

MIRA is a new analytic score for conditional distribution accuracy derived from equal probability mass assignment, enabling Bayesian model comparison via direct posterior validation.

Sampling Parallelism for Fast and Efficient Bayesian Learning

cs.LG · 2026-04-06 · unverdicted · novelty 5.0

Sampling parallelism distributes Bayesian sample evaluations across GPUs for near-perfect scaling, lower memory use, and faster convergence via per-GPU data augmentations, outperforming pure data parallelism in diversity.

Galactica: A Large Language Model for Science

cs.CL · 2022-11-16 · unverdicted · novelty 5.0

Galactica, a science-specialized LLM, reports higher scores than GPT-3, Chinchilla, and PaLM on LaTeX knowledge, mathematical reasoning, and medical QA benchmarks while outperforming general models on BIG-bench.

citing papers explorer

Showing 30 of 30 citing papers.

ENSEMBITS: an alphabet of protein conformational ensembles cs.LG · 2026-05-13 · unverdicted · none · ref 7 · 2 links
Ensembits is the first tokenizer of protein conformational ensembles that outperforms static tokenizers on RMSF prediction and matches them on function and mutation tasks while using less pretraining data.
Latent Process Generator Matching cs.LG · 2026-05-19 · unverdicted · none · ref 18
Presents a general framework for generator matching on projected image spaces from latent Markov processes, generalizing static latent results to dynamic conditional processes.
Entropy Across the Bridge: Conditional-Marginal Discretization for Flow and Schr\"odinger Samplers cs.LG · 2026-05-15 · unverdicted · none · ref 19
Derives a conditional-marginal entropy-rate objective for bridge-aware discretization that yields U-shaped schedules and improves low-NFE sample quality on 2D, CIFAR-10, and protein tasks.
ProteinJEPA: Latent prediction complements protein language models cs.LG · 2026-05-08 · unverdicted · none · ref 9
Masked-position MLM plus JEPA latent prediction outperforms MLM-only pretraining on 10-11 of 16 downstream tasks for 35M-150M protein models while JEPA alone fails.
TCD-Arena: Assessing Robustness of Time Series Causal Discovery Methods Against Assumption Violations cs.LG · 2026-05-04 · unverdicted · none · ref 150
TCD-Arena is a new customizable testing framework that runs millions of experiments to map how 33 different assumption violations affect time series causal discovery methods and shows ensembles can boost overall robustness.
Rates of forgetting for the sequentially Markov coalescent math.PR · 2026-04-22 · unverdicted · none · ref 84
SMC forgets its initial condition geometrically in the jump chain and as 1/ℓ in continuous genetic distance, justifying independent-locus approximations.
Dual Triangle Attention: Effective Bidirectional Attention Without Positional Embeddings q-bio.QM · 2026-04-09 · unverdicted · none · ref 4
Dual Triangle Attention achieves effective bidirectional attention with built-in positional inductive bias via dual triangular masks, outperforming standard bidirectional attention on position-sensitive tasks and showing strong masked language modeling results with or without positional embeddings.
Stochastic Thermodynamics of Associative Memory cond-mat.stat-mech · 2026-01-03 · unverdicted · none · ref 32
DenseAMs show tradeoffs between entropy production, retrieval accuracy, and speed at intermediate loads, with a new failure mode in higher-order networks at finite temperature.
Accelerating Inference for Multilayer Neural Networks with Quantum Computers quant-ph · 2025-10-08 · unverdicted · none · ref 12
Quantum circuits for coherent multilayer neural network inference achieve quadratic to polylogarithmic speedups over classical methods depending on quantum data access models for inputs and weights.
AlphaEvolve: A coding agent for scientific and algorithmic discovery cs.AI · 2025-06-16 · unverdicted · none · ref 47
AlphaEvolve is an LLM-orchestrated evolutionary coding agent that discovered a 4x4 complex matrix multiplication algorithm using 48 scalar multiplications, the first improvement over Strassen's algorithm in 56 years, plus optimizations for Google data centers and hardware.
Towards Understanding Self-Pretraining for Sequence Classification cs.LG · 2026-05-20 · unverdicted · none · ref 15
Self-pretraining improves Transformer sequence classification by enabling learning of proximity-biased attention from positional encodings that label supervision alone cannot easily acquire from random starts.
CrystalBoltz: End-to-End Protein Structure Determination via Experiment-Guided Diffusion for X-Ray Crystallography cs.LG · 2026-05-15 · unverdicted · none · ref 15
CrystalBoltz performs experiment-guided posterior sampling with diffusion models on structure-factor amplitudes for protein structure determination, reporting lower RMSD and R-factors than baselines with 33x faster runtime.
ShardTensor: Domain Parallelism for Scientific Machine Learning cs.DC · 2026-05-11 · unverdicted · none · ref 15
ShardTensor is a domain-parallelism system for SciML that enables flexible scaling of extreme-resolution spatial datasets by removing the constraint of batch size one per device.
Supercharging Bayesian Inference with Reliable AI-Informed Priors stat.ML · 2026-05-11 · unverdicted · none · ref 10
Rectified AI priors, obtained by correcting AI-induced data laws before embedding them in techniques like Dirichlet process priors, reduce bias, improve credible interval coverage, and boost performance in tasks like skin disease classification.
A physics-informed neural network approach to solve the spatially inhomogeneous electron Boltzmann equation physics.plasm-ph · 2026-05-05 · unverdicted · none · ref 16
A specialized PINN architecture solves the spatially inhomogeneous electron Boltzmann equation with high accuracy across gases and electric field strengths without case-specific tuning.
Flashlight: PyTorch Compiler Extensions to Accelerate Attention Variants cs.LG · 2025-11-03 · unverdicted · none · ref 12
Flashlight is a compiler-native PyTorch framework that generates efficient fused kernels for arbitrary and data-dependent attention variants, supporting more cases than FlexAttention with competitive performance.
Fast and Interpretable Protein Substructure Alignment via Optimal Transport q-bio.QM · 2025-10-12 · unverdicted · none · ref 13
PLASMA applies regularized optimal transport with Sinkhorn iterations to produce fast, interpretable residue-level alignments and similarity scores between protein structures.
Improving Inverse Folding for Peptide Design with Diversity-regularized Direct Preference Optimization cs.LG · 2024-10-25 · unverdicted · none · ref 19
Diversity-regularized DPO fine-tuning of ProteinMPNN improves structural similarity scores by at least 8% over base model and sequence diversity by up to 20% over standard DPO for peptide inverse folding on OpenFold structures.
Enabling Structure-Only Initialization and Out-of-Distribution Generalization in GNN-based Molecular Dynamics Simulators physics.chem-ph · 2026-05-10 · unverdicted · none · ref 145
GNN-based MD simulators achieve stable structure-only initialization and reliable OOD generalization through inference-time physics optimization and a GNN barostat on elastic network compression tasks.
Experiment-as-Code Labs: A Declarative Stack for AI-Driven Scientific Discovery eess.SY · 2026-05-06 · unverdicted · none · ref 60 · 2 links
The paper introduces Experiment-as-Code Labs as a declarative stack synthesizing AI agents, systems orchestration, and physical lab control for AI-driven discovery.
Benchmarking open-source tools for in silico antiviral drug discovery q-bio.BM · 2026-05-05 · conditional · none · ref 155
Boltz-2 and fine-tuned DrugFormDTA lead ML-based binding prediction while GNINA leads docking tools on a cleaned antiviral dataset, with performance varying by viral protein.
MIRA: A Score for Conditional Distribution Accuracy and Model Comparison stat.ML · 2026-05-03 · unverdicted · none · ref 141
MIRA is a new analytic score for conditional distribution accuracy derived from equal probability mass assignment, enabling Bayesian model comparison via direct posterior validation.
Sampling Parallelism for Fast and Efficient Bayesian Learning cs.LG · 2026-04-06 · unverdicted · none · ref 20
Sampling parallelism distributes Bayesian sample evaluations across GPUs for near-perfect scaling, lower memory use, and faster convergence via per-GPU data augmentations, outperforming pure data parallelism in diversity.
Galactica: A Large Language Model for Science cs.CL · 2022-11-16 · unverdicted · none · ref 186
Galactica, a science-specialized LLM, reports higher scores than GPT-3, Chinchilla, and PaLM on LaTeX knowledge, mathematical reasoning, and medical QA benchmarks while outperforming general models on BIG-bench.
AIMBio-Mat: An AI-Native FAIR Platform for Closed-Loop Materials Discovery and Biomedical Translation physics.app-ph · 2026-05-20 · unverdicted · none · ref 15
AIMBio-Mat is a conceptual blueprint for an AI-native, FAIR, governance-aware decision layer that formulates biomedical-materials discovery as constrained multi-objective optimization under uncertainty.
The Research Guide: From Informal Role to Profession physics.ed-ph · 2026-04-21 · unverdicted · none · ref 43
The authors argue that guiding non-PhD learners through authentic research requires a dedicated profession with its own training, career structure, and recognition because existing models and programs fall short.
Towards a Universal Foundation Model for Protein Dynamics: A Multi-Chain Tree-Structured Framework with Transformer Propagators physics.atom-ph · 2025-02-09 · unverdicted · none · ref 31
Proposes TSCG hierarchical representation and Transformer propagator for universal coarse-grained protein MD with claimed 10k-20k times acceleration over all-atom MD while preserving statistical properties.
On the Diffusion Time Evolution of Folding Chains in the Heteropolymer Model math.DS · 2022-08-25 · unverdicted · none · ref 9
Folding chains in the heteropolymer model diffuse according to D ~ t^ν with ν decreasing from 0.666 to 0.5 as coupling randomness increases.
NOVA: Fundamental Limits of Knowledge Discovery Through AI cs.AI · 2026-05-12 · unreviewed · ref 6
From Mechanistic to Compositional Interpretability cs.LG · 2026-05-09 · unreviewed · ref 84

Jumper , author R

hub tools

citation-role summary

citation-polarity summary

authors

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer