Efficient set-valued prediction in multi-class classification

被引:0
|
作者
Thomas Mortier
Marek Wydmuch
Krzysztof Dembczyński
Eyke Hüllermeier
Willem Waegeman
机构
[1] Ghent University,Department of Data Analysis and Mathematical Modelling
[2] Poznań Unversity of Technology,Institute of Computing Science
[3] Yahoo! Research,Institute of Informatics
[4] LMU Munich,undefined
来源
关键词
Set-valued prediction; Multi-class classification; Expected utility maximization;
D O I
暂无
中图分类号
学科分类号
摘要
In cases of uncertainty, a multi-class classifier preferably returns a set of candidate classes instead of predicting a single class label with little guarantee. More precisely, the classifier should strive for an optimal balance between the correctness (the true class is among the candidates) and the precision (the candidates are not too many) of its prediction. We formalize this problem within a general decision-theoretic framework that unifies most of the existing work in this area. In this framework, uncertainty is quantified in terms of conditional class probabilities, and the quality of a predicted set is measured in terms of a utility function. We then address the problem of finding the Bayes-optimal prediction, i.e., the subset of class labels with the highest expected utility. For this problem, which is computationally challenging as there are exponentially (in the number of classes) many predictions to choose from, we propose efficient algorithms that can be applied to a broad family of utility functions. Our theoretical results are complemented by experimental studies, in which we analyze the proposed algorithms in terms of predictive accuracy and runtime efficiency.
引用
收藏
页码:1435 / 1469
页数:34
相关论文
共 50 条