super hub Canonical reference

Deep learning

Geoffrey Hinton, Yann LeCun, Yoshua Bengio · 2015 · Nature · DOI 10.1038/nature14539 · arXiv gov/2601744

Canonical reference. 100% of citing Pith papers cite this work as background.

81 Pith papers citing it

71.9k external citations · Crossref

Background 100% of classified citations

open at publisher browse 81 citing papers more from Geoffrey Hinton arXiv PDF

hub tools

JSON dossier citing papers JSON publisher DOI arXiv source

citation-role summary

background 14 method 1

citation-polarity summary

background 15

authors

Geoffrey Hinton Yann LeCun Yoshua Bengio

co-cited works

representative citing papers

Embodied Explainability and Ontological Obstacles: Why We Struggle to Explain the Answers of Large Language Models (LLMs)

cs.HC · 2026-06-22 · unverdicted · novelty 7.0

An argument paper reframes LLM explainability as an embodied, situated practice based on Dourish and enactivist cognition, identifying ontological obstacles in internal explanations and advocating affordance-based designs.

ffortissimo: A Freeform Forward-Modeling Pipeline for High-Contrast Images of Circumstellar Disks Based on Automatic Differentiation

astro-ph.IM · 2026-06-22 · unverdicted · novelty 7.0

ffortissimo is a JAX-based freeform forward-modeling pipeline that fits complex dust distributions and infers scattering properties in KLIP-reduced images of circumstellar disks such as HR 4796A.

eCNNTO: A Highly Generalizable ConvNet for Accelerating Topology Optimization

cs.AI · 2026-06-18 · unverdicted · novelty 7.0

eCNNTO applies an element-wise CNN with residual connections and final-stage training data to accelerate density-based topology optimization while generalizing across boundary conditions, loads, geometries, and mesh sizes.

Optimal scenario design for climate emulation

physics.ao-ph · 2026-06-17 · unverdicted · novelty 7.0

Optimizing training data via a differentiable SCM yields climate emulators that outperform those trained on six standard ScenarioMIP pathways while using less data and isolating distinct forcing responses.

Multi-channel Optical Vision Model

physics.optics · 2026-06-08 · unverdicted · novelty 7.0

Spatial multiplexing in optical neural networks is repurposed as a trainable representational coordinate, demonstrated in multi-layer architectures for image classification, regression, and hybrid vision-language captioning with over one million optical phase parameters.

Inverse Critical Experiment Design via Gradient Optimization and a Multigroup Attention-Based Neural Network Architecture

cs.LG · 2026-06-01 · unverdicted · novelty 7.0

A U-Net surrogate with multigroup attention pooling is trained on OpenMC sensitivity data and combined with gradient optimization to generate grid-based critical experiment geometries that achieve c_k values up to 0.97757 for HALEU fuel validation.

Cumulative Meta-Learning from Active Learning Queries for Robustness to Spurious Correlations

cs.LG · 2026-05-20 · unverdicted · novelty 7.0

CAML meta-learns a progressively refined inductive bias from active-learning queries to improve robustness to spurious correlations, reporting accuracy gains on minority groups across several benchmarks.

Toy Combinatorial Interpretability Models Reveal Lottery Tickets in Early Feature Space

cs.LG · 2026-05-18 · unverdicted · novelty 7.0

In a combinatorial toy setting, winning lottery tickets preserve families of compatible feature locations in early feature space that balance proximity to final codes with low interference, rather than specific weight subnetworks.

DualTCN: A Physics-Constrained Temporal Convolutional Network for 2 Time-Domain Marine CSEM Inversion

cs.LG · 2026-05-06 · unverdicted · novelty 7.0

DualTCN is the first deep-learning model for time-domain marine CSEM inversion that regresses four earth parameters, achieves high accuracy on simulated data, and runs up to 21,000 times faster than classical optimizers.

Broximal Alignment for Global Non-Convex Optimization

math.OC · 2026-04-15 · unverdicted · novelty 7.0

Broximal Alignment is a novel condition under which the Ball Proximal Point Method converges to global minima in non-convex settings, generalizing quasiconvexity, star convexity, and related frameworks.

On the Decompositionality of Neural Networks

cs.LO · 2026-04-09 · unverdicted · novelty 7.0

Neural decompositionality is defined via decision-boundary semantic preservation, and language transformers largely satisfy it under SAVED while vision models often do not.

Accelerating Inference for Multilayer Neural Networks with Quantum Computers

quant-ph · 2025-10-08 · unverdicted · novelty 7.0

Quantum circuits for coherent multilayer neural network inference achieve quadratic to polylogarithmic speedups over classical methods depending on quantum data access models for inputs and weights.

Non-markovian neural quantum propagator and its application to the simulation of ultrafast nonlinear spectra

physics.chem-ph · 2024-08-01 · unverdicted · novelty 7.0

A machine learning model called neural quantum propagator is introduced to efficiently solve non-Markovian quantum dynamics described by HEOM and applied to simulate spectra of the FMO complex.

Identifying structural design principles shaping the computational abilities of recurrent neural networks

q-bio.NC · 2026-06-22 · unverdicted · novelty 6.0

Local 2- and 3-cycles enhance RNN computational capacity for Boolean functions, predicted by structural statistics, while adding interneurons boosts large networks.

Unmasking LAION-5B: Age, Gender, Race, and Emotion Biases in Large-Scale Image Datasets

cs.CV · 2026-06-22 · unverdicted · novelty 6.0

Empirical audit of LAION-2B-en and LAION-2B-multi finds overrepresentation of young adults, White people, and males plus stereotypical emotion associations across two attribute classifiers.

Constrained hybrid modelling to predict microbial dynamics and organic matter turnover in soil systems

cs.LG · 2026-06-18 · unverdicted · novelty 6.0

Hybrid neural-process model derives biokinetic parameters from genomic traits for soil organic matter turnover, with ecological constraints, and outperforms baselines on synthetic and real data.

A Geometric Measure of Linear Separability for Neural Representations

cs.LG · 2026-06-07 · unverdicted · novelty 6.0

Introduces the directional linear separability measure (LSM) as an asymmetric diagnostic for one-sided affine separability of neural representations.

Towards Unified and Data-Efficient Prognostics and Health Management with Tabular Foundation Models

cs.LG · 2026-06-03 · unverdicted · novelty 6.0

Tabular foundation models applied to PHM via signal-to-table conversion achieve the best average ranks across prognostic and diagnostic tasks and remain competitive in low-data regimes.

Parameter-efficient Dual-encoder Architecture with Differentiable Choquet Integral Fusion for Underwater Acoustic Classification

cs.SD · 2026-06-01 · unverdicted · novelty 6.0

A parameter-efficient dual-encoder model with differentiable Choquet integral fusion improves underwater acoustic classification accuracy over single-encoder baselines on DeepShip and ShipsEar datasets.

Ultrafast formation of a large dynamic magnetic soliton

cond-mat.mes-hall · 2026-05-30 · unverdicted · novelty 6.0

Observation of ultrafast large dynamic magnetic soliton formation inside the linear spin-wave band in garnet films, extending tens of microns and collapsing into short-wavelength spin waves at large distances.

Composing Non-Conjugate Factor Graphs with Closed-Form Variational Inference

cs.LG · 2026-05-28 · unverdicted · novelty 6.0

Models composed from bilinear factor, exponential link, Gamma prior, Gaussian likelihood, and equality node admit closed-form variational message passing under mean-field factorization.

Picid: A Modular Evaluation Infrastructure for Reproducible PHM Across Tasks and Domains

cs.AI · 2026-05-27 · unverdicted · novelty 6.0

Picid is a new modular evaluation infrastructure that enforces deterministic, leakage-safe dataset construction and unified protocols for fault detection, diagnostics, and prognostics across twelve datasets and thirteen models.

On the Equivariant Learning of the $Q$-tensor Order Parameter

cond-mat.soft · 2026-05-26 · unverdicted · novelty 6.0

Equivariant neural networks for 2D Q-tensor prediction in nematic liquid crystals achieve lower errors and better generalization than non-equivariant models while satisfying symmetry constraints.

Symmetrization of Loss Functions for Robust Training of Neural Networks in the Presence of Noisy Labels

cs.LG · 2026-05-19 · unverdicted · novelty 6.0 · 2 refs

Symmetrizing cross-entropy produces the unique convex multi-class unhinged loss, which locally approximates other symmetric losses, and enables new interpolating losses SGCE and alpha-MAE with competitive performance on noisy-label benchmarks.

citing papers explorer

Showing 31 of 81 citing papers.

Determination of Nanoparticle and Microdroplet Parameters in Levitating Microdroplets of Suspension by Speckle Image Analysis Using Convolutional Neural Networks physics.app-ph · 2026-04-08 · unverdicted · none · ref 11
CNNs trained on speckle images from levitating TiO2 suspension microdroplets classify droplet diameter with better than 6% accuracy and provide useful discrimination for nanoparticle concentration and diameter, including simultaneous three-parameter classification.
Operator-Theoretic Energy Functionals for Impulse-Excited Nonstationary Signal Analysis eess.SP · 2026-04-07 · unverdicted · none · ref 26
An operator-based Energy Concentration Index yields the IMRED detector that identifies defect-induced changes in impulse responses with AUC 0.908, outperforming standard Fourier and wavelet energy measures.
Vanishing Contributions: A Unified Framework for Smooth and Iterative Model Compression cs.LG · 2025-10-09 · unverdicted · none · ref 1
VCON is a unified framework for smooth iterative DNN compression that uses parallel execution and an affine combination to progressively replace the original model with its compressed form during fine-tuning.
Bayesian Reasoning for Physics Informed Neural Networks physics.comp-ph · 2023-08-25 · unverdicted · none · ref 1
Introduces Laplace-approximated Bayesian PINNs for automatic loss-weight optimization when solving PDEs such as heat, wave, and Burgers equations.
General Inverse Design of Thin-Film Metamaterials With Convolutional Neural Networks physics.comp-ph · 2021-03-29 · unverdicted · none · ref 40
Convolutional neural networks are shown to perform inverse design of thin-film metamaterial stacks by learning the mapping from structure to ellipsometric and reflectance/transmittance spectra, with efficiency gains over traditional optimization as layer count increases.
MAPE: Defending Against Transferable Adversarial Attacks Using Multi-Source Adversarial Perturbations Elimination cs.CV · 2026-06-30 · unverdicted · none · ref 1
MAPE combines a channel-attention U-Net (SAPE) trained on multi-model adversarial examples scheduled by PPSA to eliminate perturbations, reporting over 95.1% average defense on CIFAR-10 and 71.5% on Mini-ImageNet against black-box transferable attacks.
From Sentiment to Actionable Insights: A Data-Driven Public Sentiment Analysis of Advanced Air Mobility cs.CL · 2026-06-18 · unverdicted · none · ref 52
Applies standard sentiment classifiers and topic modeling to a large AAM discussion corpus, identifies six clusters of public concern, and lists strategies to address them.
Learning Entropy and Spatial Adaptation Dynamics of Multilayer Perceptrons for Structural Point Extraction cs.LG · 2026-06-08 · unverdicted · none · ref 4
Spatial Learning Entropy Maps derived from MLP weight adaptations during spatial pixel prediction tasks highlight image points with high learning impact.
Business World Model cs.AI · 2026-06-08 · unverdicted · none · ref 5
This paper introduces the Business World Model, a conceptual architecture that encodes business states, dynamics, and actions using semantic representations to support autonomous planning.
Learning to model pediatric asthma exacerbation from multiple risk factors: a case study in coastal Virginia cs.LG · 2026-06-04 · unverdicted · none · ref 34
A case study develops a sparse dictionary learning approach to model pediatric asthma exacerbations from multiple risk factors and reports consensus on relative risks across statistical and machine learning models.
Machine-Learning-Enhanced Non-Invasive Testing for MASLD Fibrosis: Shallow-Deep Neural Networks Versus FIB-4, Tabular Foundation Models, and Large Language Models cs.LG · 2026-05-19 · unverdicted · none · ref 21
A 354-parameter shallow-deep neural network using age, AST, ALT, platelets and FIB-4 achieved external ROC-AUCs of 0.77 and 0.67 for advanced MASLD fibrosis, slightly above FIB-4's 0.75 and 0.60 on Malaysian and Indian cohorts.
The New Associationism: Lessons from Deep Learning cs.AI · 2026-05-19 · unverdicted · none · ref 132
Supervised learning across AI systems vindicates a uniform error-driven associationism for cognition, though operating inside advanced computational structures beyond classical associationist models.
Joint sparse coding and temporal dynamics support context reconfiguration q-bio.NC · 2026-05-11 · unverdicted · none · ref 8
Joint sparse coding and temporal dynamics in mPFC and computational networks reduce cross-context interference and enhance separability, enabling better retention in lifelong learning without extra heuristics.
Single-Cycle Multidirectional EOG Classification Faster than Human Reaction Time for Wearable Human-Computer Interactions eess.SP · 2026-04-27 · unverdicted · none · ref 23
Cascaded neural networks classify 10 eye-movement classes from single-cycle EOG signals at 99% accuracy with sub-83 ms latency below human reaction time.
Using Deep Learning Models Pretrained by Self-Supervised Learning for Protein Localization cs.CV · 2026-04-13 · unverdicted · none · ref 21
DINO-based ViT models pretrained on HPA FOV achieve macro F1 of 0.822 zero-shot and 0.860 after fine-tuning for protein localization on OpenCell, demonstrating effective transfer from SSL pretraining.
The ZTF-ULTRASAT experiment: Characterizing the non-transients in ULTRASAT's high cadence survey astro-ph.SR · 2026-04-08 · unverdicted · none · ref 45
ZTF high-cadence data shows RR Lyrae stars and flaring sources can mimic UV transients, with pre-existing ML catalogs offering a concrete mitigation approach.
Machine Learning-Based Cluster Classification to Suppress Background in a Prototype RPC Detector physics.ins-det · 2026-03-30 · unverdicted · none · ref 10
Machine learning classifiers using fifteen cluster-level descriptors from time and ADC distributions effectively separate signal from background hits in prototype RPC detectors.
Perception Gaps in Risk, Benefit, and Value Between Experts and Public Challenge Socially Accepted AI cs.CY · 2024-12-02 · unverdicted · none · ref 50
Experts rate AI scenarios as more likely, less risky, more beneficial, and more valuable than the public, applying different weightings to risk versus benefit.
Multiple Object Detection and Tracking in Panoramic Videos for Cycling Safety Analysis cs.CV · 2024-07-21 · unverdicted · none · ref 11
A framework segments panoramic video into sub-images for detection, modifies multi-object tracking for boundary continuity, and applies it to vehicle overtaking detection in real cycling videos, reporting gains in precision and an F-score of 0.82.
The Mathematics of AI Winters: The mathematical Taxonomy of Paradigm Fragility in AI Winter cs.LG · 2026-06-10 · unverdicted · none · ref 19
Established mathematical bottlenecks in representation, optimization, complexity, and high-dimensional learning aligned with the central disappointments of early AI research periods.
A Proof-of-Concept Simulation-Driven Digital Twin Framework for Decision-Aware Diabetes Modeling cs.LG · 2026-05-11 · unverdicted · none · ref 30
A simulation-driven digital twin framework is shown to generate interpretable diabetes trajectories for decision-aware analysis by combining benchmark data with controlled synthetic scenarios.
A Specialized Importance-Aware Quantum Convolutional Neural Network with Ring-Topology (IA-QCNN) for MGMT Promoter Methylation Prediction in Glioblastoma quant-ph · 2026-04-24 · unverdicted · none · ref 41
IA-QCNN applies quantum principles via ring-topology convolution and importance weighting to achieve claimed high-accuracy MGMT methylation prediction from MRI with fewer parameters and noise robustness than classical models.
Supplementary Materials to Graph Convolutional Branch and Bound cs.LG · 2024-06-05 · unverdicted · none · ref 5
Supplementary results on 1-tree relaxation performance inside a GCN-augmented branch-and-bound solver for TSP.
Statistical Properties of Training & Generalization stat.ML · 2026-06-18 · unverdicted · none · ref 201
Neural scaling laws in deep learning interact with physics constraints and inductive biases beyond classical statistics.
Software Platform for Hybrid Pseudo-Random Sequence Generation and Predictability Analysis Based on LFSR and Mersenne Twister quant-ph · 2026-05-29 · unverdicted · none · ref 57
Software platform for hybrid LFSR-MT PRNG generation and ML-based predictability analysis, reporting inherent limitations in classical generators versus quantum randomness.
MiniGPT: Rebuilding GPT from First Principles cs.CL · 2026-05-17 · conditional · none · ref 32
MiniGPT is a self-contained PyTorch implementation of standard GPT autoregressive modeling that reaches 1.478 validation loss on Tiny Shakespeare with a 10.77M-parameter model and produces recognizable Shakespeare-style text.
AI-Powered Surrogate Modelling for Multiscale Combustion: A Critical Review and Opportunities physics.chem-ph · 2026-04-28 · unverdicted · none · ref 37
A critical review of AI surrogate models for multiscale combustion that compares supervised, unsupervised, and physics-guided methods, identifies transferability and consistency challenges, and outlines future opportunities.
Enhancing Laser Surface Texturing through Advanced Machine Learning Techniques cond-mat.mtrl-sci · 2026-04-14 · unverdicted · none · ref 10
Neural networks and random forests predict surface roughness from laser parameters and material data with high accuracy, speeding up optimization and reducing experimental effort.
Deep Learning for Sequential Decision Making under Uncertainty: Foundations, Frameworks, and Frontiers math.OC · 2026-04-13 · unverdicted · none · ref 82
A tutorial framing deep learning as a complement to optimization for sequential decision-making under uncertainty, with applications in supply chains, healthcare, and energy.
A Survey on Deep Learning Architectures for Point Cloud Classification and Segmentation cs.CV · 2026-05-16 · unverdicted · none · ref 48 · 2 links
A survey that categorizes deep learning models for point cloud tasks by backbone architecture, evaluates benchmark performance, and outlines challenges and future research directions.
Deep Learning in the Automotive Industry: Recent Advances and Application Examples cs.LG · 2019-06-20 · unverdicted · none · ref 20
An overview of deep learning applications and challenges in the automotive industry, covering ADAS, automated driving, virtual sensing, and data-driven development.

Deep learning

hub tools

citation-role summary

citation-polarity summary

authors

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer