First use of the learned harmonic mean estimator for Bayesian model selection across circular/eccentric, white-noise/GP, and trend variants in radial velocity exoplanet analyses.
hub Tool reference
Statist.] 10.1214/aos/1176344136 , 6, 461
Tool reference. 83% of classified Pith citations use this work as a method, library, or software dependency, not as a substantive claim.
hub tools
citation-role summary
citation-polarity summary
representative citing papers
First measurement of ^4_ΛHe yields in 3 GeV Au+Au collisions shows consistency with ^4_ΛH yields and JAM coalescence model while thermal model overpredicts absolute yields.
Direct fixed-weight solver for free-support Wasserstein medians relocates atoms using OT barycentric projections and inverse-distance weights, achieving monotone descent on smoothed objectives with fewer subproblems than nested Weiszfeld baselines.
Nonthermal line broadening at solar flare footpoints is primarily field-aligned, demonstrated by systematic decrease in line widths from disk center to limb across 4,593 Hinode/EIS spectra from 407 flares.
EML-CD recovers causal DAG structure and closed-form mechanisms via gated EML trees, matching PC/GES SHD on Sachs data while recovering 10 of 11 function families in bivariate tests and outperforming SINDy on mechanism f-MSE.
A dual-encoder deepfake detector pairs a frozen specialist with a LoRA-tuned MLLM, trained first via binary alignment then via RL to reward explain-then-classify behavior, yielding improved cross-dataset performance and interpretability.
ProactBench measures LLM conversational proactivity in three phases using 198 multi-agent dialogues and finds recovery behavior hard to predict from existing benchmarks.
Zero-noise extrapolation has a finite-shot help-harm boundary below which it increases local mean-squared error due to variance penalties outweighing bias reduction.
JudgeSense benchmark shows LLM judge consistency does not reliably improve with model scale, with coherence most sensitive to prompt changes and factuality more stable.
Causal Process Models reframe dynamic causal graph discovery as multi-agent reinforcement learning to build sparse time-varying graphs only at active interactions, outperforming dense baselines on physical prediction.
vsOED uses a variational one-point reward and RL policy optimization to provide a lower bound on expected information gain for sequential experimental design, supporting nuisance parameters, implicit likelihoods, and multiple design goals.
Kolmogorov n-width theory plus PRESS statistics yield closed-form optimal spline resolution; KORE estimates bias/noise scales from two pilots and matches CV performance with far fewer fits.
Pre-registered validation of an ML Na-cathode voltage screen yields 0.67 V MAE against experiment, with Materials Project PBE+U references 0.54 V low and voltage-dependent residuals preventing calibration, leading to retirement of the screen.
A timing-based data-driven method to measure single-particle corner-clipping probabilities in segmented detectors, validated on Pierre Auger Underground Muon Detector simulations and parameterized by an analytical model.
Proposes and analyzes a homogeneity test using squared L2 distance of empirical EOT maps to uniform-on-ball reference, with FCLT, Gaussian quadratic null limit, consistency, local power, and weighted multiplier bootstrap.
RankElastor mitigates embedding collapse via spectrum-robust token mixing and GLU-based P-FFNs, yielding better performance and scaling on industrial recommendation datasets.
COO co-optimizes orbitals with TrimCI to absorb many-body correlations into the basis, cutting determinant count by orders of magnitude for iron-sulfur clusters versus localized bases or DMRG.
Proposes adaptive multiple importance sampling for robust Bayesian model evidence estimation under parameter non-identifiability, shown to outperform deterministic methods on ecological case studies while being cheaper than MCMC.
A Bayesian model for multi-feature contact matrices that uses tensor structures and contingency table theory to satisfy structural constraints and impute missing contact features, validated on simulations and US/German survey data.
Bayesian procedures are derived to compute the posterior probability that a recoverable process is currently in control or that a drifting latent parameter lies in an acceptable region.
A semi-supervised kernel two-sample test integrates unlabeled covariate data to achieve asymptotic normality under the null, higher power than standard kernel tests, and consistency against fixed and local alternatives.
Task-aligned supervised geometric stability predicts linear steerability with high accuracy while unsupervised stability detects representational drift earlier and with lower false alarms than CKA or Procrustes.
42% of significant turn-level associations in LLM conversation analysis are spurious due to unaccounted autocorrelation, with a validated two-stage correction framework improving replication.
Bio-PINNs with a near-to-far curriculum and deformation-uncertainty proxy recover cell-induced densified phases and tether morphologies more reliably than standard adaptive PINN baselines in single-cell and multicellular settings.
citing papers explorer
-
Improving exoplanet mass characterisation with Bayesian model selection using the Learned Harmonic Mean Estimator
First use of the learned harmonic mean estimator for Bayesian model selection across circular/eccentric, white-noise/GP, and trend variants in radial velocity exoplanet analyses.
-
Hyper-Nuclei $^4_{\Lambda}\hbox{He}$ Production in $\sqrt{s_{\rm{NN}}}$ = 3 GeV Au+Au collisions at RHIC
First measurement of ^4_ΛHe yields in 3 GeV Au+Au collisions shows consistency with ^4_ΛH yields and JAM coalescence model while thermal model overpredicts absolute yields.
-
Fast Computation of Free-Support Wasserstein Medians
Direct fixed-weight solver for free-support Wasserstein medians relocates atoms using OT barycentric projections and inverse-distance weights, achieving monotone descent on smoothed objectives with fewer subproblems than nested Weiszfeld baselines.
-
Nonthermal line broadening at solar flare footpoints is primarily field-aligned
Nonthermal line broadening at solar flare footpoints is primarily field-aligned, demonstrated by systematic decrease in line widths from disk center to limb across 4,593 Hinode/EIS spectra from 407 flares.
-
EML-CD: Causal Mechanism Recovery via EML Symbolic Trees in Structure Learning
EML-CD recovers causal DAG structure and closed-form mechanisms via gated EML trees, matching PC/GES SHD on Sachs data while recovering 10 of 11 function families in bivariate tests and outperforming SINDy on mechanism f-MSE.
-
The Regularizing Power of Language-Training Deepfake Detectors
A dual-encoder deepfake detector pairs a frozen specialist with a LoRA-tuned MLLM, trained first via binary alignment then via RL to reward explain-then-classify behavior, yielding improved cross-dataset performance and interpretability.
-
ProactBench: Beyond What The User Asked For
ProactBench measures LLM conversational proactivity in three phases using 198 multi-agent dialogues and finds recovery behavior hard to predict from existing benchmarks.
-
The finite-shot help-harm boundary of zero-noise extrapolation
Zero-noise extrapolation has a finite-shot help-harm boundary below which it increases local mean-squared error due to variance penalties outweighing bias reduction.
-
JudgeSense: A Benchmark for Prompt Sensitivity in LLM-as-a-Judge Systems
JudgeSense benchmark shows LLM judge consistency does not reliably improve with model scale, with coherence most sensitive to prompt changes and factuality more stable.
-
Causal Process Models: Reframing Dynamic Causal Graph Discovery as a Reinforcement Learning Problem
Causal Process Models reframe dynamic causal graph discovery as multi-agent reinforcement learning to build sparse time-varying graphs only at active interactions, outperforming dense baselines on physical prediction.
-
Variational Sequential Optimal Experimental Design using Reinforcement Learning
vsOED uses a variational one-point reward and RL policy optimization to provide a lower bound on expected information gain for sequential experimental design, supporting nuisance parameters, implicit likelihoods, and multiple design goals.
-
Solve for the Hyperparameter, Skip the Search: Kolmogorov-Optimal Scaling Laws for Spline Regression
Kolmogorov n-width theory plus PRESS statistics yield closed-form optimal spline resolution; KORE estimates bias/noise scales from two pilots and matches CV performance with far fewer fits.
-
Computational references are not experiments: pre-registered validation of machine-learned sodium-cathode voltages
Pre-registered validation of an ML Na-cathode voltage screen yields 0.67 V MAE against experiment, with Materials Project PBE+U references 0.54 V low and voltage-dependent residuals preventing calibration, leading to retirement of the screen.
-
A data-driven method for measuring corner-clipping probabilities in segmented particle detectors
A timing-based data-driven method to measure single-particle corner-clipping probabilities in segmented detectors, validated on Pierre Auger Underground Muon Detector simulations and parameterized by an analytical model.
-
Two-Sample Homogeneity Test via Entropic Optimal Transport
Proposes and analyzes a homogeneity test using squared L2 distance of empirical EOT maps to uniform-on-ball reference, with FCLT, Gaussian quadratic null limit, consistency, local power, and weighted multiplier bootstrap.
-
Expand More, Shrink Less: Shaping Effective-Rank Dynamics for Dense Scaling in Recommendation
RankElastor mitigates embedding collapse via spectrum-robust token mixing and GLU-based P-FFNs, yielding better performance and scaling on industrial recommendation datasets.
-
Absorbing Many-Body Correlations into Core-Optimized Orbitals
COO co-optimizes orbitals with TrimCI to absorb many-body correlations into the basis, cutting determinant count by orders of magnitude for iron-sulfur clusters versus localized bases or DMRG.
-
Reliable model selection in the presence of parameter non-identifiability
Proposes adaptive multiple importance sampling for robust Bayesian model evidence estimation under parameter non-identifiability, shown to outperform deterministic methods on ecological case studies while being cheaper than MCMC.
-
Bayesian Modeling and Prediction of Generalized Contact Matrices
A Bayesian model for multi-feature contact matrices that uses tensor structures and contingency table theory to satisfy structural constraints and impute missing contact features, validated on simulations and US/German survey data.
-
Sequential Bayesian Monitoring for Recoverable and Drifting Processes
Bayesian procedures are derived to compute the posterior probability that a recoverable process is currently in control or that a drifting latent parameter lies in an acceptable region.
-
A Semi-Supervised Kernel Two-Sample Test
A semi-supervised kernel two-sample test integrates unlabeled covariate data to achieve asymptotic normality under the null, higher power than standard kernel tests, and consistency against fixed and local alternatives.
-
The Geometric Canary: Predicting Steerability and Detecting Drift via Representational Stability
Task-aligned supervised geometric stability predicts linear steerability with high accuracy while unsupervised stability detects representational drift earlier and with lower false alarms than CKA or Procrustes.
-
The Autocorrelation Blind Spot: Why 42% of Turn-Level Findings in LLM Conversation Analysis May Be Spurious
42% of significant turn-level associations in LLM conversation analysis are spurious due to unaccounted autocorrelation, with a validated two-stage correction framework improving replication.
-
Cell-induced densification and tether formation in fibrous extracellular matrices with biomimetic physics-informed neural networks
Bio-PINNs with a near-to-far curriculum and deformation-uncertainty proxy recover cell-induced densified phases and tether morphologies more reliably than standard adaptive PINN baselines in single-cell and multicellular settings.
-
A decay-adjusted spatio-temporal model to account for the impact of mass drug administration on neglected tropical disease prevalence
A new decay-adjusted spatio-temporal model improves estimation of neglected tropical disease prevalence by explicitly accounting for the waning impact of mass drug administration in sparse survey data.
-
INLA-RF: A Hybrid Modeling Strategy for Spatio-Temporal Environmental Data
A hybrid INLA-RF framework integrates Bayesian spatio-temporal modeling with random forests through two iterative algorithms to improve predictions and uncertainty quantification for environmental data.
-
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
RAPTOR introduces a tree-organized retrieval method using recursive abstractive summaries, achieving a 20% absolute accuracy improvement on the QuALITY benchmark when paired with GPT-4.
-
Stability selection enables robust learning of partial differential equations from limited noisy data
PDE-STRIDE applies stability-based model selection to sparse regression for robust, parameter-free recovery of PDEs from noisy data.
-
Commensal discovery of four Fast Radio Bursts during Parkes Pulsar Timing Array observations
Four new FRBs discovered commensally during Parkes PTA pulsar observations, including one with record S/N and unusual spectrum; all highly polarized.
-
dynesty: A Dynamic Nested Sampling Package for Estimating Bayesian Posteriors and Evidences
dynesty is an open-source Python package for dynamic nested sampling that improves efficiency in Bayesian posterior and evidence estimation compared to MCMC on certain problems.
-
A Spitzer Space Telescope Exploration Science Program to Search for Y Dwarf Variability
Spitzer mid-IR monitoring of all known Y dwarfs yields variability fractions of 35-75% that, when combined with L/T dwarf data, weakly support condensate cloud variations as the driver of brown dwarf variability.
-
The $M_{\rm BH}$$-$$R_{\rm b}$ relation and the high-mass end of the $M_{\rm BH}$$-$$\sigma$ relation
Updated M_BH-R_b relation for core-Sérsic galaxies has slope 1.16 and 0.28 dex scatter; large-core galaxies drive a high-mass upturn in M_BH-σ attributed to successive dry mergers.
-
A GPH-Filtered Hannan--Rissanen Information Criterion for ARFIMA Order Selection
A consistent two-stage GPH-filtered Hannan-Rissanen generalized information criterion for selecting finite AR and MA orders in ARFIMA models with growing candidate sets.
-
Density Evolution: A Multiscale View of Density Estimation
A review reframing density estimation as 'density evolution' across scales, linking kernel smoothing to heat flow, mixtures to compression, and topology to level sets, while stating three structural results on modes, Gaussian semigroups, and log-concavity.
-
A tool to determine the degrees of freedom in tree-structured varying coefficient models
A formula approximating degrees of freedom for tree-structured varying coefficient models is proposed to improve BIC model selection over naive parameter counting.
-
Unsupervised Domain Shift Detection with Interpretable Subspace Attribution
An unsupervised method detects domain shifts via localized density anomaly search in feature space, attributes the shift to a minimal subspace, and extracts balanced subsets from two unlabeled datasets.
-
Fast and principled equation discovery from chaos to climate
Bayesian-ARGOS is a hybrid frequentist-Bayesian method that discovers equations from limited noisy observations more efficiently than SINDy or bootstrap-ARGOS while adding uncertainty quantification.
-
Multi-fidelity Gaussian process regression for noisy outputs and non-nested experimental designs: a comparison between the recursive and non-recursive formulations
Recursive multi-fidelity GP regression with EM optimization trains faster than the coupled non-recursive Kennedy-O'Hagan approach on noisy non-nested data while delivering comparable predictions and uncertainty estimates.
-
QWHA: Quantization-Aware Walsh-Hadamard Adaptation for Parameter-Efficient Fine-Tuning on Large Language Models
QWHA proposes Walsh-Hadamard Transform adapters with adaptive initialization for quantization-aware PEFT, claiming better low-bit accuracy and faster training than low-rank or other FT-based baselines.
-
Understanding Task Representations in Neural Networks via Bayesian Ablation
A Bayesian ablation framework combined with information-theoretic metrics is introduced to analyze causal roles, distributedness, manifold complexity, and polysemanticity of task representations in neural networks.
-
Adaptive Pricing in Insurance: Generalized Linear Models and Gaussian Process Regression Approaches
Adaptive GLM with MQLE and GP regression with UCB for dynamic insurance pricing, showing parameter convergence and regret analysis under delayed claims.
-
Quantity-Dependent Bulk-to-Wall Observability of Surface Loading in Rarefied Hypersonic Flow over Triangular Protrusions
DSMC-based neighborhood analysis of rarefied hypersonic flow over triangular protrusions shows quantity-dependent bulk-to-wall information requirements, with no universal length scale for surface loading.
-
LEGGOS II: A Strong Lens Model and Source-Plane Projection of the Clumpy Star-Forming Galaxy SGASJ111020.0+645950.8 at z=2.48
An updated strong lensing model of SDSSJ1110+6459 using JWST-resolved clumps yields M(<250 kpc) = 1.21^{+0.09}_{-0.04} imes 10^{14} M_/sun, heta_E = 10.8^{+0.3}_{-0.4} arcsec, and \mu_tot = 24.2^{+3.4}_{-1.2} with 2-8 imes better magnification precision than prior models.
-
GJ 3929 b as the First Complete Rocky Worlds DDT Data Set
Complete four-observation JWST dataset for GJ 3929 b yields 118±22 ppm eclipse depth, consistent with bare rock while allowing thin atmospheres and ruling out only thick CO2 cases at >3σ.
-
Validation-Gated Multi-Agent Governance for Online Adaptation of Thermal-Hydraulic Surrogate Models under Operating-Regime Shift
A validation-gated multi-agent framework enables online adaptation of thermal-hydraulic surrogates and reduces forecast error by 19% under regime shifts on experimental loop data.
-
Power Studies For Two-Sample and Goodness-of-Fit Methods For Multivariate Data
No single goodness-of-fit or two-sample test reliably detects deviations across all multivariate scenarios, so the authors recommend a small combination of methods that together cover the simulated cases.
-
Spatially Resolved Kinematics of SLACS Lens Galaxies. II: Breaking Degeneracies with Lensing and Dynamical Models
Spatially resolved kinematics show SLACS lens galaxies have nearly isothermal total mass profiles (mean γ=2.04) with average mass-sheet parameter λ_int=1.01, consistent with no measurable bias from power-law assumptions in cosmography.
-
Latent Profiles of AI Risk Perception and Their Differential Association with Community Driving Safety Concerns: A Person-Centered Analysis
Four latent profiles of AI risk perception were identified in U.S. adults, with higher AI concern generally linked to greater perceived driving-hazard severity except for AI-versus-human driving comparisons.
-
Environment-Aware Indoor LoRaWAN Path Loss: Parametric Regression Comparisons, Shadow Fading, and Calibrated Fade Margins
Environment-conditioned parametric regression on 12-month indoor LoRaWAN data reduces cross-validated RMSE from 8.23 dB to 7.38 dB and lowers the fade margin needed for 99% reliability from ~28 dB to 25.73 dB.
-
Constraining Dark Energy Dynamics in Curved Spacetime with Current Observations
Observational constraints on a dark energy EoS parametrization in curved spacetime yield α ≈ 0.35 (0.56) and Ω_k0 that changes sign with ANN data reconstruction.