A per-layer risk estimator for hybrid deep networks shows that replacing learned layers with known operators shrinks the bound and scales sample needs with the number of replaced parameters, validated on CT reconstruction.
Title resolution pending
9 Pith papers cite this work, alongside 203 external citations. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
background 1polarities
background 1representative citing papers
Finite-sample risk bounds for DQN with ReLU networks are extended to τ-mixing data, showing an extra dimensionality penalty in the convergence rate due to dependence.
A recipe translates ReLU approximations to softmax attention with target-specific economic bounds for multiplication, reciprocal computation, and min/max primitives.
A semi-supervised kernel two-sample test integrates unlabeled covariate data to achieve asymptotic normality under the null, higher power than standard kernel tests, and consistency against fixed and local alternatives.
SDAMI detects interactions in high-dimensional data via an Effect Footprint principle and models them using sparsity, group lasso, and dedicated deep subnetworks for improved interpretability.
Neural feature maps create expressive kernels that enable fast, scalable, and consistent exact Gaussian process inference for regression and classification.
The deep SPAR model shows concurrent floods and droughts becoming more likely in the Upper Danube by 2100 under high emissions, with changes in the dependence between catchments contributing substantially to the increase.
RecPIE jointly optimizes recommendation predictions and LLM-generated natural-language explanations via alternating training and reinforcement learning, yielding 3-4% accuracy gains and higher human preference on Google Maps POI data.
citing papers explorer
-
A Deep Risk Estimator for Known Operator Learning
A per-layer risk estimator for hybrid deep networks shows that replacing learned layers with known operators shrinks the bound and scales sample needs with the number of replaced parameters, validated on CT reconstruction.
-
Transformer Approximations from ReLUs
A recipe translates ReLU approximations to softmax attention with target-specific economic bounds for multiplication, reciprocal computation, and min/max primitives.
- Approximation Error Upper and Lower Bounds for H\"{o}lder Class with Transformers