Significance of non-parametric statistical tests for comparison of classifiers over multiple datasets

被引：33

作者：

Singh, Pawan Kumar ^{[1
]}

Sarkar, Ram ^{[1
]}

Nasipuri, Mita ^{[1
]}

机构：

[1] Jadavpur Univ, Dept Comp Sci & Engn, 188 Raja SC Mullick Rd, Kolkata 700032, W Bengal, India

来源：

INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS | 2016年 / 7卷 / 05期

关键词：

statistical comparison; non-parametric test; Scheffe's test; Wilcoxon-signed rank test; Friedman test; post-hoc test;

D O I：

10.1504/IJCSM.2016.080073

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

In machine learning, generation of new algorithms or, in most cases, minor amendment of the existing ones is a common task. In such cases, a rigorous and correct statistical analysis of the results of different algorithms is necessary in order to select the exact technique(s) depending on the problem to be solved. The main inconvenience related to this necessity is the absence of proper compilation of statistical techniques. In this paper, we propose the use of two important non-parametric statistical tests, namely, Wilcoxon signed rank test for comparison of two classifiers and Friedman test with the corresponding post-hoc tests for comparison of multiple classifiers over multiple datasets. We also introduce a new variant of non-parametric test known as Scheffe's test for locating unequal pairs of means of performances of multiple classifiers when the given datasets are of unequal sizes. The parametric tests, which were previously being used for comparing multiple classifiers, have also been described in brief. The proposed non-parametric tests have also been applied on the classification results on ten real-problem datasets taken from the UCI Machine Learning Database Repository (http://www.ics.uci.edu/mlearn) (Valdovinos and Sanchez, 2009) as case studies.

引用

页码：410 / 442

页数：33

共 50 条

[1] To be parametric or non-parametric, that is the question Parametric and non-parametric statistical tests
Van Buren, Eric
Herring, Amy H.
BJOG-AN INTERNATIONAL JOURNAL OF OBSTETRICS AND GYNAECOLOGY, 2020, 127 (05) : 549 - 550
[2] STATISTICAL QUESTION Parametric v non-parametric statistical tests
Sedgwick, Philip
BRITISH MEDICAL JOURNAL, 2012, 344
[3] A New Kind of Nonparametric Test for Statistical Comparison of Multiple Classifiers Over Multiple Datasets
Yu, Zhiwen
Wang, Zhiqiang
You, Jane
Zhang, Jun
Liu, Jiming
Wong, Hau-San
Han, Guoqiang
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (12) : 4418 - 4431
[4] NON-PARAMETRIC STATISTICAL TESTS OF FISHER TYPE
MARTINERIE, J
ELECTROENCEPHALOGRAPHY AND CLINICAL NEUROPHYSIOLOGY, 1985, 61 (04): : P48 - P48
[5] NON-PARAMETRIC MULTIPLE COMPARISON TECHNIQUES
MCDONALD, BJ
THOMPSON, WA
TECHNOMETRICS, 1965, 7 (02) : 274 - &
[6] Non-parametric statistical tests for informative gene selection
Ma, JW
Li, FH
Liu, JF
ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 3, PROCEEDINGS, 2005, 3498 : 697 - 702
[7] ON NON-PARAMETRIC SIGNIFICANCE TESTS 487 OF MULTIPLE AND PARTIAL CORRELATION COEFFICIENTS AND MULTIPLE REGRESSION COEFFICIENTS
BAUER, RK
BIOMETRICS, 1958, 14 (03) : 435 - 435
[8] Statistical validation of multiple classifiers over multiple datasets in the field of pattern recognition
Singh, Pawan Kumar
Sarkar, Ram
Nasipuri, Mita
INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2015, 2 (01) : 1 - 23
[9] Selection of orthogonal chromatographic systems based on parametric and non-parametric statistical tests
Forlay-Frick, P
Van Gyseghem, E
Héberger, K
Vander Heyden, Y
ANALYTICA CHIMICA ACTA, 2005, 539 (1-2) : 1 - 10
[10] Non-parametric bootstrapping of partitioned datasets
Torres-Carvajal, Omar
TAXON, 2009, 58 (03) : 955 - 958

← 1 2 3 4 5 →