Prediction and outlier detection in classification problems

· 2019 · stat.ME · arXiv 1905.04396

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

We consider the multi-class classification problem when the training data and the out-of-sample test data may have different distributions and propose a method called BCOPS (balanced and conformal optimized prediction sets). BCOPS constructs a prediction set $C(x)$ as a subset of class labels, possibly empty. It tries to optimize the out-of-sample performance, aiming to include the correct class as often as possible, but also detecting outliers $x$, for which the method returns no prediction (corresponding to $C(x)$ equal to the empty set). The proposed method combines supervised-learning algorithms with the method of conformal prediction to minimize a misclassification loss averaged over the out-of-sample distribution. The constructed prediction sets have a finite-sample coverage guarantee without distributional assumptions. We also propose a method to estimate the outlier detection rate of a given method. We prove asymptotic consistency and optimality of our proposals under suitable assumptions and illustrate our methods on real data examples.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Statistical Analysis of Nearest Neighbor Methods for Anomaly Detection

stat.ML · 2019-07-08 · unverdicted · novelty 5.0

NN anomaly detection methods achieve competitive empirical performance on benchmarks and receive finite-sample misclassification guarantees derived from empirical DTM analysis under Huber's contamination model with geometric assumptions.

A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification

cs.LG · 2021-07-15 · unverdicted · novelty 5.0

Pith review generated a malformed one-line summary.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Statistical Analysis of Nearest Neighbor Methods for Anomaly Detection stat.ML · 2019-07-08 · unverdicted · none · ref 3 · internal anchor
NN anomaly detection methods achieve competitive empirical performance on benchmarks and receive finite-sample misclassification guarantees derived from empirical DTM analysis under Huber's contamination model with geometric assumptions.

Prediction and outlier detection in classification problems

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer