Significance of non-parametric statistical tests for comparison of classifiers over multiple datasets

被引：33

作者：

Singh, Pawan Kumar ^{[1
]}

Sarkar, Ram ^{[1
]}

Nasipuri, Mita ^{[1
]}

机构：

[1] Jadavpur Univ, Dept Comp Sci & Engn, 188 Raja SC Mullick Rd, Kolkata 700032, W Bengal, India

来源：

INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS | 2016年 / 7卷 / 05期

关键词：

statistical comparison; non-parametric test; Scheffe's test; Wilcoxon-signed rank test; Friedman test; post-hoc test;

D O I：

10.1504/IJCSM.2016.080073

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

In machine learning, generation of new algorithms or, in most cases, minor amendment of the existing ones is a common task. In such cases, a rigorous and correct statistical analysis of the results of different algorithms is necessary in order to select the exact technique(s) depending on the problem to be solved. The main inconvenience related to this necessity is the absence of proper compilation of statistical techniques. In this paper, we propose the use of two important non-parametric statistical tests, namely, Wilcoxon signed rank test for comparison of two classifiers and Friedman test with the corresponding post-hoc tests for comparison of multiple classifiers over multiple datasets. We also introduce a new variant of non-parametric test known as Scheffe's test for locating unequal pairs of means of performances of multiple classifiers when the given datasets are of unequal sizes. The parametric tests, which were previously being used for comparing multiple classifiers, have also been described in brief. The proposed non-parametric tests have also been applied on the classification results on ten real-problem datasets taken from the UCI Machine Learning Database Repository (http://www.ics.uci.edu/mlearn) (Valdovinos and Sanchez, 2009) as case studies.

引用

页码：410 / 442

页数：33

共 50 条

[21] Comparison of Parametric and Non-Parametric Statistical Features for Z-Wave Fingerprinting
Patel, Hiren J.
Ramsey, Benjamin W.
2015 IEEE MILITARY COMMUNICATIONS CONFERENCE (MILCOM 2015), 2015, : 378 - 382
[22] t-tests, non-parametric tests, and large studies—a paradox of statistical practice?
Morten W Fagerland
BMC Medical Research Methodology, 12
[23] The fusion of parametric and non-parametric hypothesis tests
Singer, PF
FUSION 2003: PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE OF INFORMATION FUSION, VOLS 1 AND 2, 2003, : 780 - 784
[24] PARAMETRIC HYPOTHESES TESTING WITH NON-PARAMETRIC TESTS
TYURIN, YN
THEORY OF PROBILITY AND ITS APPLICATIONS,USSR, 1970, 15 (04): : 722 - &
[25] An experimental comparison of non-parametric classifiers for time-constrained classification tasks
Kraaijveld, MA
FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 428 - 435
[26] A hybrid genetic based functional link artificial neural network with a statistical comparison of classifiers over multiple datasets
Dehuri, Satchidananda
Cho, Sung-Bae
NEURAL COMPUTING & APPLICATIONS, 2010, 19 (02): : 317 - 328
[27] A hybrid genetic based functional link artificial neural network with a statistical comparison of classifiers over multiple datasets
Satchidananda Dehuri
Sung-Bae Cho
Neural Computing and Applications, 2010, 19 : 317 - 328
[28] A study on the use of statistical tests for experimentation with neural networks: Analysis of parametric test conditions and non-parametric tests
Luengo, Julian
Garcia, Salvador
Herrera, Francisco
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) : 7798 - 7808
[29] Non-parametric tests of returns to scale
Simar, L
Wilson, PW
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2002, 139 (01) : 115 - 132
[30] CORRECTING FOR TIES IN NON-PARAMETRIC TESTS
CARYL, PG
BULLETIN OF THE BRITISH PSYCHOLOGICAL SOCIETY, 1984, 37 (JAN): : 22 - 22

← 1 2 3 4 5 →