Although being a crucial question for the development of machine learning algorithms, there is still no consensus on how to compare classifiers over multiple data sets with respect to several criteria. Every comparison framework is confronted with (at least) three fundamental challenges: the multiplicity of quality criteria, the multiplicity of data sets and the randomness of the selection of data sets. In this paper, we add a fresh view to the vivid debate by adopting recent developments in decision theory. Based on so-called preference systems, our framework ranks classifiers by a generalized concept of stochastic dominance, which powerfully circumvents the cumbersome, and often even self-contradictory, reliance on aggregates. Moreover, we show that generalized stochastic dominance can be operationalized by solving easy-to-handle linear programs and moreover statistically tested employing an multiple quality criteria simultaneously. We illustrate and investigate our framework in a simulation study and with a set of standard benchmark data sets.
机构:
McMaster Univ, Dept Math & Stat, Hamilton, ON, Canada
King Abdulaziz Univ, Dept Stat, Jeddah 21413, Saudi ArabiaMcMaster Univ, Dept Math & Stat, Hamilton, ON, Canada
Balakrishnan, Narayanaswamy
Haidari, Abedin
论文数: 0引用数: 0
h-index: 0
机构:
Shahid Beheshti Univ, Fac Math Sci, Tehran, IranMcMaster Univ, Dept Math & Stat, Hamilton, ON, Canada
Haidari, Abedin
Masoumifard, Khaled
论文数: 0引用数: 0
h-index: 0
机构:
Univ Tehran, Sch Math Stat & Comp Sci, Tehran, IranMcMaster Univ, Dept Math & Stat, Hamilton, ON, Canada
机构:
North China Elect Power Univ, Dept Math & Phys, Baoding 071003, Peoples R ChinaNorth China Elect Power Univ, Dept Math & Phys, Baoding 071003, Peoples R China
Wang, Wen-Xin
Ma, Yan-Peng
论文数: 0引用数: 0
h-index: 0
机构:
North China Elect Power Univ, Dept Math & Phys, Baoding 071003, Peoples R ChinaNorth China Elect Power Univ, Dept Math & Phys, Baoding 071003, Peoples R China