Statistical Comparisons of Classifiers by Generalized Stochastic Dominance

被引:0
|
作者
Jansen, Christoph [1 ]
Nalenz, Malte [1 ]
Schollmeyer, Georg [1 ]
Augustin, Thomas [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, Dept Stat, Ludwigstr 33, D-80539 Munich, Germany
关键词
algorithm comparison; statistical test; generalized stochastic dominance; preference system; decision theory; MULTIPLE ALGORITHMS; SELECTION; REGULARIZATION; DESIGN; TESTS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Although being a crucial question for the development of machine learning algorithms, there is still no consensus on how to compare classifiers over multiple data sets with respect to several criteria. Every comparison framework is confronted with (at least) three fundamental challenges: the multiplicity of quality criteria, the multiplicity of data sets and the randomness of the selection of data sets. In this paper, we add a fresh view to the vivid debate by adopting recent developments in decision theory. Based on so-called preference systems, our framework ranks classifiers by a generalized concept of stochastic dominance, which powerfully circumvents the cumbersome, and often even self-contradictory, reliance on aggregates. Moreover, we show that generalized stochastic dominance can be operationalized by solving easy-to-handle linear programs and moreover statistically tested employing an multiple quality criteria simultaneously. We illustrate and investigate our framework in a simulation study and with a set of standard benchmark data sets.
引用
收藏
页数:37
相关论文
共 50 条