hub Tool reference

Statist.] 10.1214/aos/1176344136 , 6, 461

Bradley Efron · 1979 · arXiv aos/1176344

Tool reference. 71% of classified Pith citations use this work as a method, library, or software dependency, not as a substantive claim.

61 Pith papers citing it

Method reference 71% of classified citations

read on arXiv browse 61 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

method 5 background 2

citation-polarity summary

use method 5 background 2

representative citing papers

FLOATBench: A Dataset and Benchmark for Floating Offshore Wind Turbine Tower Fatigue

cs.AI · 2026-05-25 · unverdicted · novelty 8.0

FLOATBench is a tabular benchmark dataset with 582,120 fatigue labels from 19,404 OpenFAST simulations of three 22 MW FOWT towers, featuring alpha-shape regime partitioning and three evaluation protocols for surrogate models.

Improving exoplanet mass characterisation with Bayesian model selection using the Learned Harmonic Mean Estimator

astro-ph.EP · 2026-06-25 · unverdicted · novelty 7.0

First use of the learned harmonic mean estimator for Bayesian model selection across circular/eccentric, white-noise/GP, and trend variants in radial velocity exoplanet analyses.

Hyper-Nuclei $^4_{\Lambda}\hbox{He}$ Production in $\sqrt{s_{\rm{NN}}}$ = 3 GeV Au+Au collisions at RHIC

nucl-ex · 2026-06-21 · unverdicted · novelty 7.0

First measurement of ^4_ΛHe yields in 3 GeV Au+Au collisions shows consistency with ^4_ΛH yields and JAM coalescence model while thermal model overpredicts absolute yields.

Fast Computation of Free-Support Wasserstein Medians

stat.CO · 2026-06-17 · unverdicted · novelty 7.0

Direct fixed-weight solver for free-support Wasserstein medians relocates atoms using OT barycentric projections and inverse-distance weights, achieving monotone descent on smoothed objectives with fewer subproblems than nested Weiszfeld baselines.

Nonthermal line broadening at solar flare footpoints is primarily field-aligned

astro-ph.SR · 2026-06-04 · unverdicted · novelty 7.0

Nonthermal line broadening at solar flare footpoints is primarily field-aligned, demonstrated by systematic decrease in line widths from disk center to limb across 4,593 Hinode/EIS spectra from 407 flares.

EML-CD: Causal Mechanism Recovery via EML Symbolic Trees in Structure Learning

stat.ML · 2026-06-04 · unverdicted · novelty 7.0

EML-CD recovers causal DAG structure and closed-form mechanisms via gated EML trees, matching PC/GES SHD on Sachs data while recovering 10 of 11 function families in bivariate tests and outperforming SINDy on mechanism f-MSE.

The Regularizing Power of Language-Training Deepfake Detectors

cs.CV · 2026-05-29 · unverdicted · novelty 7.0

A dual-encoder deepfake detector pairs a frozen specialist with a LoRA-tuned MLLM, trained first via binary alignment then via RL to reward explain-then-classify behavior, yielding improved cross-dataset performance and interpretability.

ProactBench: Beyond What The User Asked For

cs.LG · 2026-05-09 · unverdicted · novelty 7.0

ProactBench measures LLM conversational proactivity in three phases using 198 multi-agent dialogues and finds recovery behavior hard to predict from existing benchmarks.

The finite-shot help-harm boundary of zero-noise extrapolation

quant-ph · 2026-05-07 · unverdicted · novelty 7.0

Zero-noise extrapolation has a finite-shot help-harm boundary below which it increases local mean-squared error due to variance penalties outweighing bias reduction.

JudgeSense: A Benchmark for Prompt Sensitivity in LLM-as-a-Judge Systems

cs.CL · 2026-04-26 · unverdicted · novelty 7.0

JudgeSense benchmark shows LLM judge consistency does not reliably improve with model scale, with coherence most sensitive to prompt changes and factuality more stable.

Causal Process Models: Reframing Dynamic Causal Graph Discovery as a Reinforcement Learning Problem

cs.LG · 2025-07-18 · unverdicted · novelty 7.0

Causal Process Models reframe dynamic causal graph discovery as multi-agent reinforcement learning to build sparse time-varying graphs only at active interactions, outperforming dense baselines on physical prediction.

Variational Sequential Optimal Experimental Design using Reinforcement Learning

stat.ML · 2023-06-17 · unverdicted · novelty 7.0

vsOED uses a variational one-point reward and RL policy optimization to provide a lower bound on expected information gain for sequential experimental design, supporting nuisance parameters, implicit likelihoods, and multiple design goals.

Solve for the Hyperparameter, Skip the Search: Kolmogorov-Optimal Scaling Laws for Spline Regression

cs.LG · 2026-06-22 · unverdicted · novelty 6.0

Kolmogorov n-width theory plus PRESS statistics yield closed-form optimal spline resolution; KORE estimates bias/noise scales from two pilots and matches CV performance with far fewer fits.

Computational references are not experiments: pre-registered validation of machine-learned sodium-cathode voltages

cond-mat.mtrl-sci · 2026-06-19 · unverdicted · novelty 6.0 · 2 refs

Pre-registered validation of an ML Na-cathode voltage screen yields 0.67 V MAE against experiment, with Materials Project PBE+U references 0.54 V low and dominating the error.

A data-driven method for measuring corner-clipping probabilities in segmented particle detectors

astro-ph.IM · 2026-06-09 · unverdicted · novelty 6.0

A timing-based data-driven method to measure single-particle corner-clipping probabilities in segmented detectors, validated on Pierre Auger Underground Muon Detector simulations and parameterized by an analytical model.

Two-Sample Homogeneity Test via Entropic Optimal Transport

stat.ME · 2026-06-09 · unverdicted · novelty 6.0

Proposes and analyzes a homogeneity test using squared L2 distance of empirical EOT maps to uniform-on-ball reference, with FCLT, Gaussian quadratic null limit, consistency, local power, and weighted multiplier bootstrap.

TypedCSIP: Typed Counterfactual Pretraining for Chinese Legislative Conflict Classification

cs.CL · 2026-05-25 · unverdicted · novelty 6.0

TypedCSIP applies typed counterfactual selective intervention pretraining on expert revisions to lift macro-F1 by 0.9-1.3 pp on the LCR-CN Chinese legislative conflict classification benchmark under a pre-registered multi-seed test.

Expand More, Shrink Less: Shaping Effective-Rank Dynamics for Dense Scaling in Recommendation

cs.LG · 2026-05-22 · unverdicted · novelty 6.0

RankElastor mitigates embedding collapse via spectrum-robust token mixing and GLU-based P-FFNs, yielding better performance and scaling on industrial recommendation datasets.

Absorbing Many-Body Correlations into Core-Optimized Orbitals

quant-ph · 2026-05-21 · unverdicted · novelty 6.0

COO co-optimizes orbitals with TrimCI to absorb many-body correlations into the basis, cutting determinant count by orders of magnitude for iron-sulfur clusters versus localized bases or DMRG.

Reliable model selection in the presence of parameter non-identifiability

stat.ME · 2026-05-19 · unverdicted · novelty 6.0

Proposes adaptive multiple importance sampling for robust Bayesian model evidence estimation under parameter non-identifiability, shown to outperform deterministic methods on ecological case studies while being cheaper than MCMC.

Bayesian Modeling and Prediction of Generalized Contact Matrices

stat.ME · 2026-05-07 · unverdicted · novelty 6.0

A Bayesian model for multi-feature contact matrices that uses tensor structures and contingency table theory to satisfy structural constraints and impute missing contact features, validated on simulations and US/German survey data.

Sequential Bayesian Monitoring for Recoverable and Drifting Processes

stat.CO · 2026-05-05 · unverdicted · novelty 6.0

Bayesian procedures are derived to compute the posterior probability that a recoverable process is currently in control or that a drifting latent parameter lies in an acceptable region.

A Semi-Supervised Kernel Two-Sample Test

stat.ML · 2026-05-03 · unverdicted · novelty 6.0

A semi-supervised kernel two-sample test integrates unlabeled covariate data to achieve asymptotic normality under the null, higher power than standard kernel tests, and consistency against fixed and local alternatives.

The Geometric Canary: Predicting Steerability and Detecting Drift via Representational Stability

cs.LG · 2026-04-20 · unverdicted · novelty 6.0

Task-aligned supervised geometric stability predicts linear steerability with high accuracy while unsupervised stability detects representational drift earlier and with lower false alarms than CKA or Procrustes.

citing papers explorer

Showing 11 of 61 citing papers.

Power Studies For Two-Sample and Goodness-of-Fit Methods For Multivariate Data stat.ME · 2026-05-12 · unverdicted · none · ref 30
No single goodness-of-fit or two-sample test reliably detects deviations across all multivariate scenarios, so the authors recommend a small combination of methods that together cover the simulated cases.
Spatially Resolved Kinematics of SLACS Lens Galaxies. II: Breaking Degeneracies with Lensing and Dynamical Models astro-ph.GA · 2026-04-14 · unverdicted · none · ref 57
Spatially resolved kinematics show SLACS lens galaxies have nearly isothermal total mass profiles (mean γ=2.04) with average mass-sheet parameter λ_int=1.01, consistent with no measurable bias from power-law assumptions in cosmography.
Latent Profiles of AI Risk Perception and Their Differential Association with Community Driving Safety Concerns: A Person-Centered Analysis cs.CY · 2026-04-06 · unverdicted · none · ref 44
Four latent profiles of AI risk perception were identified in U.S. adults, with higher AI concern generally linked to greater perceived driving-hazard severity except for AI-versus-human driving comparisons.
Environment-Aware Indoor LoRaWAN Path Loss: Parametric Regression Comparisons, Shadow Fading, and Calibrated Fade Margins cs.NI · 2025-10-05 · conditional · none · ref 48 · 2 links
Environment-conditioned parametric regression on 12-month indoor LoRaWAN data reduces cross-validated RMSE from 8.23 dB to 7.38 dB and lowers the fade margin needed for 99% reliability from ~28 dB to 25.73 dB.
ROC Analysis for Evaluating Translation Quality Estimation Systems cs.CL · 2026-05-23 · unverdicted · none · ref 6
ROC analysis is proposed for evaluating translation quality estimation systems, claimed to match existing methods while providing actionable business insights.
Constraining Dark Energy Dynamics in Curved Spacetime with Current Observations physics.gen-ph · 2026-05-08 · unverdicted · none · ref 18
Observational constraints on a dark energy EoS parametrization in curved spacetime yield α ≈ 0.35 (0.56) and Ω_k0 that changes sign with ANN data reconstruction.
AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration cs.AI · 2026-05-19 · unreviewed · ref 6
Evaluating Deep Research Agents on Expert Consulting Work: A Benchmark with Verifiers, Rubrics, and Cognitive Traps cs.AI · 2026-05-17 · unreviewed · ref 6
How to quantify direct correlations between variables stat.ME · 2026-04-20 · unreviewed · ref 42
Nested Sampling for ARIMA Model Selection in Astronomical Time-Series Analysis astro-ph.IM · 2025-12-01 · unreviewed · ref 44
How Many Human Survey Respondents is a Large Language Model Worth? An Uncertainty Quantification Perspective stat.ME · 2025-02-25 · unreviewed · ref 22

Statist.] 10.1214/aos/1176344136 , 6, 461

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer