An exact algebraic identity plus low-rank SVD and Haar-measure null-space approximation reduce per-point mean curvature cost from O(m^4) to O(k^2 m + k m p^2) with 50-300x speedups and negligible accuracy loss.
hub
van Rijn, Bernd Bischl, and Luis Torgo
19 Pith papers cite this work. Polarity classification is still indexing.
hub tools
citation-role summary
citation-polarity summary
roles
background 2polarities
background 2representative citing papers
ScoreStop introduces a functional score test for early stopping in gradient boosting, testing the null that the current predictor minimizes population risk with a scale-invariant statistic of known asymptotic distribution.
Presents pseudo-polynomial DP algorithm O(W k n²) for weighted kNN Banzhaf valuation and O(n k²) for unweighted, plus Monte Carlo estimators, after proving #P-hardness.
PFN-TS converts PFN posterior predictives into mean-reward samples for Thompson sampling using a subsampled predictive CLT, with consistency proofs, regret bounds, and strong empirical performance on synthetic and real bandit benchmarks.
FLOWGEM generates complete data under non-monotone MAR missingness by discretizing Wasserstein gradient flows with a local linear density-ratio estimator to minimize expected KL divergence over missingness patterns.
Defines saturation index S(K) = erank(Σ̂_W^(K))/K that identifies when linear discriminant stabilizes in binary few-shot classification, with empirical phase diagram and stopping-rule AUC of 0.752 on 17 tasks.
Bayesian PCFG generates synthetic physics-like regression datasets matching eight structural features of the Feynman corpus and enabling equivalent hyperparameter tuning performance to real data.
Path-based adaptive weighting of random forest trees via decision path patterns delivers statistically significant accuracy gains on 36 binary classification benchmarks with minimal class-recall regression.
Introduces gradient-discrepancy acquisition criterion derived from Luo et al. (2022) generalization bound for active learning.
Quantum kernel methods show no statistically significant edge over strong classical baselines on tabular classification tasks, with current feature maps failing to match the spectral properties of the best classical kernel.
TFMPathy applies tabular foundation models to summary statistics of visual features for subject-generalizable empathy detection under strong privacy constraints, with improved cross-subject performance on a public benchmark.
Derives optimal low-rank subspace for Laplace approx in BNNs, provides scalable outperforming version, and new comparison metric.
Soft Learning optimally combines heterogeneous ML specialists via cross-validated non-negative least squares, achieving top performance on 70% of 37 datasets with formal guarantees and 72-435x CPU speedups over deep networks.
The paper defines algorithmic contestability as identifying evidence to overturn potentially incorrect decisions and identifies three types of such evidence that make decisions normatively indefensible under the decision maker's standards.
TabGRAA applies group-relative advantage alignment in an iterative reward-guided post-training loop to improve tabular language model generators on fidelity, utility, and privacy trade-offs across five benchmarks.
Large-scale neutral benchmark of survival models on low-dimensional right-censored data finds Cox PH performs comparably to more complex methods across discrimination, calibration, and predictive metrics.
A CBR system based on similarity of local explanations provides visualizations that fraud analysts at a Dutch bank found useful and easy to use for processing ML-generated fraud alerts.
Human-grounded evaluation finds no significant performance improvement from adding SHAP explanations to model confidence scores in alert processing.
Conditional inference forests rank competitively as top-k feature selectors in classification and regression benchmarks, with runtime factors identified but limited impact on scores.
citing papers explorer
-
ScoreStop: Gradient-based early stopping using functional score tests
ScoreStop introduces a functional score test for early stopping in gradient boosting, testing the null that the current predictor minimizes population risk with a scale-invariant statistic of known asymptotic distribution.
-
PFN-TS: Thompson Sampling for Contextual Bandits via Prior-Data Fitted Networks
PFN-TS converts PFN posterior predictives into mean-reward samples for Thompson sampling using a subsampled predictive CLT, with consistency proofs, regret bounds, and strong empirical performance on synthetic and real bandit benchmarks.
-
Generative Modeling under Non-Monotone MAR Missingness via Approximate Wasserstein Gradient Flows
FLOWGEM generates complete data under non-monotone MAR missingness by discretizing Wasserstein gradient flows with a local linear density-ratio estimator to minimize expected KL divergence over missingness patterns.
-
Explainable AI Isn't Enough! Rethinking Algorithmic Contestability
The paper defines algorithmic contestability as identifying evidence to overturn potentially incorrect decisions and identifies three types of such evidence that make decisions normatively indefensible under the decision maker's standards.
-
A Large-Scale Neutral Comparison Study of Survival Models on Low-Dimensional Data
Large-scale neutral benchmark of survival models on low-dimensional right-censored data finds Cox PH performs comparably to more complex methods across discrimination, calibration, and predictive metrics.