hub Mixed citations

Bayesian Active Learning for Classification and Preference Learning

· 2011 · stat.ML · arXiv 1112.5745

Mixed citation behavior. Most common role is background (67%).

40 Pith papers citing it

Background 67% of classified citations

open full Pith review browse 40 citing papers arXiv PDF

abstract

Information theoretic active learning has been widely studied for probabilistic models. For simple regression an optimal myopic policy is easily tractable. However, for other tasks and with more complex models, such as classification with nonparametric models, the optimal solution is harder to compute. Current approaches make approximations to achieve tractability. We propose an approach that expresses information gain in terms of predictive entropies, and apply this method to the Gaussian Process Classifier (GPC). Our approach makes minimal approximations to the full information theoretic objective. Our experimental performance compares favourably to many popular active learning algorithms, and has equal or lower computational complexity. We compare well to decision theoretic approaches also, which are privy to more information and require much more computational time. Secondly, by developing further a reformulation of binary preference learning to a classification problem, we extend our algorithm to Gaussian Process preference learning.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 5 method 1

citation-polarity summary

background 4 unclear 1 use method 1

representative citing papers

The Minimax Rate of Second-Order Calibration

cs.LG · 2026-05-08 · unverdicted · novelty 8.0

The minimax rate of estimating second-order calibration error is Õ(1/√n) with a matching Ω(1/√n) lower bound, enabled by analyticity from the sech kernel and yielding the first finite-sample guarantee for second-order Platt scaling.

Variance Reduction for Stochastic Gradient Generalized Non-reversible Langevin Monte Carlo Algorithms

stat.ML · 2026-06-27 · unverdicted · novelty 7.0

Proves CLT for stochastic gradient non-reversible Langevin Monte Carlo and sufficient condition for variance reduction via anti-symmetric perturbation relative to reversible baseline.

Can In-Context Learning Support Intrinsic Curiosity?

cs.LG · 2026-06-17 · unverdicted · novelty 7.0

ICL-derived intrinsic rewards are biased in general MDPs but asymptotically match true learning progress in non-temporal settings, with supporting experiments.

SymQNet: Amortized Acquisition for Low-Latency Adaptive Hamiltonian Learning

cs.LG · 2026-06-11 · unverdicted · novelty 7.0

SymQNet uses offline RL to amortize acquisition for adaptive Hamiltonian learning, delivering 47-72x lower decision latency than online Bayesian baselines on Ising models while keeping posterior updates.

ATLAS: Active Theory Learning for Automated Science

cs.LG · 2026-06-10 · unverdicted · novelty 7.0

ATLAS uses active learning with disentangled RNN ensembles to design experiments that recover RL agent models from bandit behavior 5-10x more efficiently than random or expert baselines in simulations.

Optimal-Point Variance Reduction For Bayesian Optimization With Regret Guarantee

cs.LG · 2026-05-31 · unverdicted · novelty 7.0

OVR is a new one-step lookahead BO algorithm with Monte Carlo approximations that achieves a vanishing Bayesian expected simple regret upper bound.

Active Timepoint Selection for Learning Measure-Valued Trajectories

cs.LG · 2026-05-28 · unverdicted · novelty 7.0

Introduces an active learning method using LOT and Gaussian Processes to select optimal timepoints for inferring continuous probability paths from sparse distributional snapshots.

On the Construction and Implications of Low-Loss Valleys in LoRA-based Bayesian Inference

cs.LG · 2026-05-28 · unverdicted · novelty 7.0

Introduces LoRA-Curve parameterization to link independent LoRA optima via low-loss valleys, yielding higher predictive mutual information on reasoning and classification tasks with Qwen2.5 7B.

Active Learning with Selective Time-Step Acquisition for PDEs

cs.LG · 2025-11-22 · unverdicted · novelty 7.0

STAP reduces training data costs for PDE surrogates by selectively acquiring key time steps per trajectory instead of full simulations.

Active Learning for Gaussian Process Regression Under Self-Induced Boltzmann Weights

cs.LG · 2026-05-11 · unverdicted · novelty 7.0

AB-SID-iVAR enables Gaussian process active learning for self-induced Boltzmann distributions by closed-form approximation of the target, with high-probability error vanishing guarantees and empirical gains on PES and drug discovery tasks.

LLM-guided Semi-Supervised Approaches for Social Media Crisis Data Classification

cs.AI · 2026-05-08 · conditional · novelty 7.0

LG-CoTrain, an LLM-guided co-training method, outperforms classical semi-supervised baselines for crisis tweet classification in low-resource settings with 5-25 labeled examples per class.

InfoChess: A Game of Adversarial Inference and a Laboratory for Quantifiable Information Control

cs.MA · 2026-04-15 · unverdicted · novelty 7.0

InfoChess proposes a symmetric adversarial game focused purely on information control and probabilistic king-location inference, with RL agents outperforming heuristic baselines and gameplay dissected via belief entropy, cross-entropy, and predictive scores.

Ask the Right Comparison:Bias-Aware Bayesian Active Top-$k$ Ranking with LLM Judges

cs.LG · 2026-07-02 · unverdicted · novelty 6.0

A bias-aware Bayesian model with judge-specific covariates and a top-k membership uncertainty acquisition rule recovers accurate top-k rankings from noisy LLM judges using fewer comparisons than naive aggregation or standard active learning.

Bayesian Adaptation Gym: A Benchmark for the Bayesian Low-Rank Adaptation of Multi-Modal Language Models

cs.LG · 2026-06-20 · unverdicted · novelty 6.0

Introduces BAG, an open-source benchmark suite with baselines, datasets, and tasks for assessing Bayesian low-rank adaptation of multi-modal language models on calibration, robustness, and decision-making under uncertainty.

On the QUEST for Uncertainty Quantification via Highest Density Regions

cs.LG · 2026-06-17 · unverdicted · novelty 6.0

QUEST measures uncertainty via the Lebesgue volume of highest-density regions of a distribution's support, evaluated at robustness parameter alpha, and claims to satisfy UQ axioms while outperforming variance and differential entropy on selective prediction tasks.

Surprise-Guided MergeSort: Budget-Efficient Human-in-the-Loop Ranking via Adaptive Comparison Scheduling

cs.LG · 2026-06-14 · unverdicted · novelty 6.0

Surprise-Guided MergeSort uses a VLM-based composite surprise scorer to prioritize human comparisons inside a MergeSort scheduler, skipping up to 535 pairs per session and raising Kendall's τ by 6-12 points over Active Elo at fixed budget across six benchmarks.

Epistemic Uncertainty Is Not the Reducible Kind

stat.ML · 2026-06-10 · unverdicted · novelty 6.0

The mutual-information measure of epistemic uncertainty is not reducible by additional data, requiring a split into aleatoric, sample-reducible epistemic, and mechanism-reducible epistemic uncertainty.

ShaplEIG: Bayesian Experimental Design for Shapley Value Estimation

stat.ML · 2026-06-01 · unverdicted · novelty 6.0

ShaplEIG adaptively selects coalitions for Shapley value estimation via expected information gain from a Gaussian process surrogate, with closed-form computation and polynomial complexity, showing improved sample efficiency in low-budget regimes.

Active Learning MPC Objective Functions from Preferences

eess.SY · 2026-05-15 · unverdicted · novelty 6.0

Active learning strategies for preference-based MPC objective learning achieve better closed-loop alignment with human preferences using fewer queries than random sampling in numerical tests.

Adaptive Prompt Elicitation for Text-to-Image Generation

cs.HC · 2026-02-04 · unverdicted · novelty 6.0

Adaptive Prompt Elicitation (APE) uses an information-theoretic framework to generate visual queries that elicit and compile user intent into better prompts for text-to-image models, showing improved alignment in benchmarks and a user study.

Epistemic Uncertainty for Test-Time Discovery

cs.LG · 2026-05-11 · unverdicted · novelty 6.0

UG-TTT adds epistemic uncertainty measured by adapter disagreement as an exploration bonus in RL for LLMs, raising maximum reward and diversity on scientific discovery benchmarks.

Decoupled PFNs: Identifiable Epistemic-Aleatoric Decomposition via Structured Synthetic Priors

stat.ML · 2026-05-07 · conditional · novelty 6.0

Decoupled PFNs use controllable synthetic priors to train separate latent-signal and noise heads, making epistemic-aleatoric decomposition identifiable and improving acquisition in noisy settings.

Active Learning for Communication Structure Optimization in LLM-Based Multi-Agent Systems

cs.MA · 2026-05-07 · unverdicted · novelty 6.0 · 2 refs

An ensemble-based information-theoretic active learning method using ensemble Kalman inversion selects valuable tasks to optimize communication structures in LLM multi-agent systems more reliably than random sampling under limited training budgets.

Vibrotactile Preference Learning: Uncertainty-Aware Preference Learning for Personalized Vibration Feedback

cs.HC · 2026-04-22 · unverdicted · novelty 6.0

VPL learns individualized vibrotactile preferences efficiently via uncertainty-aware Gaussian process models and active query selection in a 13-participant user study on an Xbox controller.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.

Bayesian Active Learning for Classification and Preference Learning

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer