hub Mixed citations

Pytorch: An imperative style, high-performance deep learning library.Advances in neural information processing systems, 32

Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, et al · 2019

Mixed citation behavior. Most common role is background (45%).

31 Pith papers citing it

Background 45% of classified citations

browse 31 citing papers

hub tools

JSON dossier citing papers JSON

citation-role summary

background 6 method 4 other 1

citation-polarity summary

background 5 use method 4 unclear 2

representative citing papers

Canonical Regularisation of Wide Feature-Learning Neural Networks

stat.ML · 2026-05-18 · unverdicted · novelty 8.0

Derives geodesic ridge regularization and Riemannian Gibbs Process prior for feature-learning wide neural networks, generalizing kernel-regime results via function-space axiomatization.

Learning Through Noise: Why Subliminal Learning Works and When It Fails

cs.LG · 2026-05-22 · unverdicted · novelty 7.0

Subliminal learning occurs via compatible auxiliary and class output heads on task-unrelated inputs, even with random hidden layers or architecture changes, with theory and upper bounds on failure.

Classical State Preparation for Variational Quantum Algorithms via Reinforcement Learning

quant-ph · 2026-05-22 · unverdicted · novelty 7.0

CRiSP uses neural-guided MCTS and curriculum learning to insert Clifford prefixes before parameterized rotations in VQAs, yielding mean 3.17x and max 45x gains in energy accuracy on 22-qubit QAOA benchmarks versus prior Clifford initializers.

Unlocking Patch-Level Features for CLIP-Based Class-Incremental Learning

cs.CV · 2026-05-13 · unverdicted · novelty 7.0

SPA unlocks patch-level features in CLIP for class-incremental learning via semantic-guided selection and optimal transport alignment with class descriptions, plus projectors and pseudo-feature replay to reduce forgetting.

Spectral Energy Centroid: a Metric for Improving Performance and Analyzing Spectral Bias in Implicit Neural Representations

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

Spectral Energy Centroid is a new metric that quantifies signal frequency and INR spectral bias, supporting better hyperparameter selection and cross-architecture analysis.

QAP-Router: Tackling Qubit Routing as Dynamic Quadratic Assignment with Reinforcement Learning

quant-ph · 2026-05-12 · unverdicted · novelty 7.0

QAP-Router models qubit routing as dynamic QAP and applies RL with a solution-aware Transformer to cut CNOT counts by 12-30% versus industry compilers on real circuit benchmarks.

LookWhen? Fast Video Recognition by Learning When, Where, and What to Compute

cs.CV · 2026-05-07 · conditional · novelty 7.0

LookWhen factorizes video recognition into learning when, where, and what to compute via uniqueness-based token selection and dual-teacher distillation, achieving better accuracy-FLOPs trade-offs than baselines on multiple datasets.

Parametrizing Convex Sets Using Sublinear Neural Networks

math.OC · 2026-05-05 · unverdicted · novelty 7.0

Sublinear neural networks parametrize convex sets by learning their support and gauge functions, backed by a universal approximation theorem and tested on shape optimization tasks.

Scalable Memristive-Friendly Reservoir Computing for Time Series Classification

cs.NE · 2026-04-21 · unverdicted · novelty 7.0

MARS parallel reservoirs achieve up to 21x training speedups and outperform LRU, S5, and Mamba on long sequence benchmarks while remaining gradient-free and compact.

Multiscale topology optimization of compressible and nearly incompressible anisotropic hyperelastic structures using physics-augmented neural networks

cs.CE · 2026-04-07 · unverdicted · novelty 7.0

Physics-augmented neural networks act as stable, thermodynamically consistent surrogates for microscale problems, enabling simultaneous optimization of macroscale material layout and microscale descriptors in nonlinear finite-strain anisotropic hyperelastic structures.

MemDLM: Memory-Enhanced DLM Training

cs.CL · 2026-03-23 · unverdicted · novelty 7.0

MemDLM embeds a simulated denoising trajectory into DLM training via bi-level optimization, creating a parametric memory that improves convergence and long-context performance even when the memory is dropped at test time.

FBApro: A fast, simple linear transformation for diverse metabolic modeling tasks

q-bio.QM · 2026-01-21 · unverdicted · novelty 7.0

FBApro computes the nearest steady-state flux distribution to a reference vector via a closed-form linear projection derived from orthogonal projections onto affine spaces.

Accelerating Video Inverse Problem Solvers with Autoregressive Diffusion Models

cs.CV · 2026-05-20 · unverdicted · novelty 6.0

AVIS applies autoregressive diffusion models to video inverse problems by streaming restoration with measurement-consistent initialization, reducing latency from 114s to 4s and raising throughput to 1.18 FPS (or 5.91 FPS in the Flash variant).

GeoHand: Unlocking Prior Geometry Knowledge for Monocular 3D Hand Reconstruction

cs.CV · 2026-05-17 · unverdicted · novelty 6.0

GeoHand adapts priors from a general-scene geometry estimator via a GeoAdapter, gated fusion, and keypoint-queried refiner to reach SOTA monocular 3D hand reconstruction on FreiHAND, DexYCB, and HO3Dv3 under heavy occlusion.

Learning plug-in surrogate endpoints for randomized experiments

cs.LG · 2026-05-12 · unverdicted · novelty 6.0

Two methods are introduced to learn plug-in composite surrogates that maximize effect predictiveness, with the direct surrogate-effect modeling approach outperforming baselines on synthetic data with known effects and real-world experiment data.

Enabling Performant and Flexible Model-Internal Observability for LLM Inference

cs.LG · 2026-05-11 · unverdicted · novelty 6.0

DMI-Lib delivers 0.4-6.8% overhead for offline batch LLM inference and ~6% for moderate online serving while exposing rich internal signals across backends, cutting latency overhead 2-15x versus prior observability baselines.

DuetFair: Coupling Inter- and Intra-Subgroup Robustness for Fair Medical Image Segmentation

cs.CV · 2026-05-11 · unverdicted · novelty 6.0

DuetFair couples inter-subgroup adaptation with intra-subgroup robustness via FairDRO (dMoE plus subgroup-conditioned DRO) to boost worst-case and equity-scaled performance on medical segmentation benchmarks.

Inference-Time Attribute Distribution Alignment for Unconditional Diffusion

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

An optimal control formulation adds time-dependent perturbations to the reverse diffusion process to match target attribute distributions while preserving sample fidelity.

FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control

cs.LG · 2026-04-06 · unverdicted · novelty 6.0 · 2 refs

FlashSAC improves training speed and final performance of off-policy RL on high-dimensional robot tasks by reducing update frequency, increasing model scale, and bounding norms to limit critic error accumulation.

Harnessing Reasoning Trajectories for Hallucination Detection via Answer-agreement Representation Shaping

cs.LG · 2026-01-24 · unverdicted · novelty 6.0

ARS shapes reasoning trace representations by clustering states that produce consistent answers and separating those that produce inconsistent ones via latent perturbations, improving plug-and-play hallucination detection without human annotations.

stable-worldmodel: A Platform for Reproducible World Modeling Research and Evaluation

cs.LG · 2026-05-20 · unverdicted · novelty 5.0

The paper presents stable-worldmodel (swm), a platform with high-performance data layer, modern world model baselines, planning solvers, and extended environments for reproducible research and generalization evaluation.

MahaVar: OOD Detection via Class-wise Mahalanobis Distance Variance under Neural Collapse

cs.LG · 2026-05-14 · conditional · novelty 5.0

MahaVar augments the Mahalanobis OOD score with class-wise distance variance, which is theoretically higher for in-distribution samples under relaxed Neural Collapse geometry.

Generating Symmetric Materials using Latent Flow Matching

cs.LG · 2026-05-11 · unverdicted · novelty 5.0

SymADiT generates stable symmetric materials by enforcing Wyckoff-position and space-group constraints inside a latent generative model built on the prior ADiT architecture.

Learning Unified Representations of Normalcy for Time Series Anomaly Detection

cs.LG · 2026-05-10 · unverdicted · novelty 5.0

U²AD learns unified normal data representations via score-based generative modeling and a novel time-dependent score network to outperform prior methods in accuracy and early anomaly detection for multivariate time series.

citing papers explorer

Showing 31 of 31 citing papers.

Canonical Regularisation of Wide Feature-Learning Neural Networks stat.ML · 2026-05-18 · unverdicted · none · ref 35
Derives geodesic ridge regularization and Riemannian Gibbs Process prior for feature-learning wide neural networks, generalizing kernel-regime results via function-space axiomatization.
Learning Through Noise: Why Subliminal Learning Works and When It Fails cs.LG · 2026-05-22 · unverdicted · none · ref 19
Subliminal learning occurs via compatible auxiliary and class output heads on task-unrelated inputs, even with random hidden layers or architecture changes, with theory and upper bounds on failure.
Classical State Preparation for Variational Quantum Algorithms via Reinforcement Learning quant-ph · 2026-05-22 · unverdicted · none · ref 74
CRiSP uses neural-guided MCTS and curriculum learning to insert Clifford prefixes before parameterized rotations in VQAs, yielding mean 3.17x and max 45x gains in energy accuracy on 22-qubit QAOA benchmarks versus prior Clifford initializers.
Unlocking Patch-Level Features for CLIP-Based Class-Incremental Learning cs.CV · 2026-05-13 · unverdicted · none · ref 37
SPA unlocks patch-level features in CLIP for class-incremental learning via semantic-guided selection and optimal transport alignment with class descriptions, plus projectors and pseudo-feature replay to reduce forgetting.
Spectral Energy Centroid: a Metric for Improving Performance and Analyzing Spectral Bias in Implicit Neural Representations cs.LG · 2026-05-12 · unverdicted · none · ref 12
Spectral Energy Centroid is a new metric that quantifies signal frequency and INR spectral bias, supporting better hyperparameter selection and cross-architecture analysis.
QAP-Router: Tackling Qubit Routing as Dynamic Quadratic Assignment with Reinforcement Learning quant-ph · 2026-05-12 · unverdicted · none · ref 50
QAP-Router models qubit routing as dynamic QAP and applies RL with a solution-aware Transformer to cut CNOT counts by 12-30% versus industry compilers on real circuit benchmarks.
LookWhen? Fast Video Recognition by Learning When, Where, and What to Compute cs.CV · 2026-05-07 · conditional · none · ref 34
LookWhen factorizes video recognition into learning when, where, and what to compute via uniqueness-based token selection and dual-teacher distillation, achieving better accuracy-FLOPs trade-offs than baselines on multiple datasets.
Parametrizing Convex Sets Using Sublinear Neural Networks math.OC · 2026-05-05 · unverdicted · none · ref 25
Sublinear neural networks parametrize convex sets by learning their support and gauge functions, backed by a universal approximation theorem and tested on shape optimization tasks.
Scalable Memristive-Friendly Reservoir Computing for Time Series Classification cs.NE · 2026-04-21 · unverdicted · none · ref 32
MARS parallel reservoirs achieve up to 21x training speedups and outperform LRU, S5, and Mamba on long sequence benchmarks while remaining gradient-free and compact.
Multiscale topology optimization of compressible and nearly incompressible anisotropic hyperelastic structures using physics-augmented neural networks cs.CE · 2026-04-07 · unverdicted · none · ref 76
Physics-augmented neural networks act as stable, thermodynamically consistent surrogates for microscale problems, enabling simultaneous optimization of macroscale material layout and microscale descriptors in nonlinear finite-strain anisotropic hyperelastic structures.
MemDLM: Memory-Enhanced DLM Training cs.CL · 2026-03-23 · unverdicted · none · ref 26
MemDLM embeds a simulated denoising trajectory into DLM training via bi-level optimization, creating a parametric memory that improves convergence and long-context performance even when the memory is dropped at test time.
FBApro: A fast, simple linear transformation for diverse metabolic modeling tasks q-bio.QM · 2026-01-21 · unverdicted · none · ref 36
FBApro computes the nearest steady-state flux distribution to a reference vector via a closed-form linear projection derived from orthogonal projections onto affine spaces.
Accelerating Video Inverse Problem Solvers with Autoregressive Diffusion Models cs.CV · 2026-05-20 · unverdicted · none · ref 61
AVIS applies autoregressive diffusion models to video inverse problems by streaming restoration with measurement-consistent initialization, reducing latency from 114s to 4s and raising throughput to 1.18 FPS (or 5.91 FPS in the Flash variant).
GeoHand: Unlocking Prior Geometry Knowledge for Monocular 3D Hand Reconstruction cs.CV · 2026-05-17 · unverdicted · none · ref 67
GeoHand adapts priors from a general-scene geometry estimator via a GeoAdapter, gated fusion, and keypoint-queried refiner to reach SOTA monocular 3D hand reconstruction on FreiHAND, DexYCB, and HO3Dv3 under heavy occlusion.
Learning plug-in surrogate endpoints for randomized experiments cs.LG · 2026-05-12 · unverdicted · none · ref 29
Two methods are introduced to learn plug-in composite surrogates that maximize effect predictiveness, with the direct surrogate-effect modeling approach outperforming baselines on synthetic data with known effects and real-world experiment data.
Enabling Performant and Flexible Model-Internal Observability for LLM Inference cs.LG · 2026-05-11 · unverdicted · none · ref 31
DMI-Lib delivers 0.4-6.8% overhead for offline batch LLM inference and ~6% for moderate online serving while exposing rich internal signals across backends, cutting latency overhead 2-15x versus prior observability baselines.
DuetFair: Coupling Inter- and Intra-Subgroup Robustness for Fair Medical Image Segmentation cs.CV · 2026-05-11 · unverdicted · none · ref 25
DuetFair couples inter-subgroup adaptation with intra-subgroup robustness via FairDRO (dMoE plus subgroup-conditioned DRO) to boost worst-case and equity-scaled performance on medical segmentation benchmarks.
Inference-Time Attribute Distribution Alignment for Unconditional Diffusion cs.LG · 2026-05-08 · unverdicted · none · ref 81
An optimal control formulation adds time-dependent perturbations to the reverse diffusion process to match target attribute distributions while preserving sample fidelity.
FlashSAC: Fast and Stable Off-Policy Reinforcement Learning for High-Dimensional Robot Control cs.LG · 2026-04-06 · unverdicted · none · ref 65 · 2 links
FlashSAC improves training speed and final performance of off-policy RL on high-dimensional robot tasks by reducing update frequency, increasing model scale, and bounding norms to limit critic error accumulation.
Harnessing Reasoning Trajectories for Hallucination Detection via Answer-agreement Representation Shaping cs.LG · 2026-01-24 · unverdicted · none · ref 28
ARS shapes reasoning trace representations by clustering states that produce consistent answers and separating those that produce inconsistent ones via latent perturbations, improving plug-and-play hallucination detection without human annotations.
stable-worldmodel: A Platform for Reproducible World Modeling Research and Evaluation cs.LG · 2026-05-20 · unverdicted · none · ref 15
The paper presents stable-worldmodel (swm), a platform with high-performance data layer, modern world model baselines, planning solvers, and extended environments for reproducible research and generalization evaluation.
MahaVar: OOD Detection via Class-wise Mahalanobis Distance Variance under Neural Collapse cs.LG · 2026-05-14 · conditional · none · ref 26
MahaVar augments the Mahalanobis OOD score with class-wise distance variance, which is theoretically higher for in-distribution samples under relaxed Neural Collapse geometry.
Generating Symmetric Materials using Latent Flow Matching cs.LG · 2026-05-11 · unverdicted · none · ref 40
SymADiT generates stable symmetric materials by enforcing Wyckoff-position and space-group constraints inside a latent generative model built on the prior ADiT architecture.
Learning Unified Representations of Normalcy for Time Series Anomaly Detection cs.LG · 2026-05-10 · unverdicted · none · ref 54
U²AD learns unified normal data representations via score-based generative modeling and a novel time-dependent score network to outperform prior methods in accuracy and early anomaly detection for multivariate time series.
Tabular Foundation Model for Generative Modelling cs.LG · 2026-05-10 · unverdicted · none · ref 64
TabFORGE generates high-quality synthetic tabular data by leveraging pretrained causality-aware representations in a two-stage diffusion-decoder architecture that mitigates latent distribution shifts.
Self-Play Enhancement via Advantage-Weighted Refinement in Online Federated LLM Fine-Tuning with Real-Time Feedback cs.LG · 2026-05-08 · unverdicted · none · ref 28
SPEAR enables online federated LLM fine-tuning by using feedback-guided self-play to create contrastive pairs trained with maximum likelihood on correct completions and confidence-weighted unlikelihood on incorrect ones, outperforming baselines without ground-truth contexts.
BGM-IV: an AI-powered Bayesian generative modeling approach for instrumental variable analysis stat.ML · 2026-05-07 · unverdicted · none · ref 38
BGM-IV performs nonlinear IV regression by inferring causally structured latent components and replacing the outcome likelihood with an instrument-averaged pseudo-likelihood, showing strongest results in high-dimensional covariate regimes.
Cubit: Token Mixer with Kernel Ridge Regression cs.LG · 2026-05-07 · unverdicted · none · ref 56 · 2 links
Cubit replaces Transformer's attention with a closed-form Kernel Ridge Regression token mixer and reports larger gains as training sequence length increases.
Drivetrain simulation using variational autoencoders cs.LG · 2025-01-29 · unverdicted · none · ref 42
Variational autoencoders generate jerk signals from torque inputs in electric drivetrains and outperform physics-based baselines without detailed parametrization.
Triple Configuration of Brain Networks Based on Recurrent Neural Networks: The Synergistic Effects of Exogenous Stimuli, Task Demands, and Spontaneous Activity q-bio.NC · 2026-04-26 · unverdicted · none · ref 34
RNNs with dynamic constraints applied to EEG data separate brain network activity into three configurations driven by stimuli, tasks, and spontaneous processes, highlighting the parietal network as a central hub.
Benchmarking PyCaret AutoML Against BiLSTM for Fine-Grained Emotion Classification: A Comparative Study on 20-Class Emotion Detection cs.CL · 2026-04-29 · unverdicted · none · ref 8
BiLSTM achieves 89% accuracy and 0.89 weighted F1 on 20-class emotion detection, marginally outperforming SVM at 88.11% on a 79,595-sentence dataset.

Pytorch: An imperative style, high-performance deep learning library.Advances in neural information processing systems, 32

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer