Significance of non-parametric statistical tests for comparison of classifiers over multiple datasets

被引:33
|
作者
Singh, Pawan Kumar [1 ]
Sarkar, Ram [1 ]
Nasipuri, Mita [1 ]
机构
[1] Jadavpur Univ, Dept Comp Sci & Engn, 188 Raja SC Mullick Rd, Kolkata 700032, W Bengal, India
关键词
statistical comparison; non-parametric test; Scheffe's test; Wilcoxon-signed rank test; Friedman test; post-hoc test;
D O I
10.1504/IJCSM.2016.080073
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In machine learning, generation of new algorithms or, in most cases, minor amendment of the existing ones is a common task. In such cases, a rigorous and correct statistical analysis of the results of different algorithms is necessary in order to select the exact technique(s) depending on the problem to be solved. The main inconvenience related to this necessity is the absence of proper compilation of statistical techniques. In this paper, we propose the use of two important non-parametric statistical tests, namely, Wilcoxon signed rank test for comparison of two classifiers and Friedman test with the corresponding post-hoc tests for comparison of multiple classifiers over multiple datasets. We also introduce a new variant of non-parametric test known as Scheffe's test for locating unequal pairs of means of performances of multiple classifiers when the given datasets are of unequal sizes. The parametric tests, which were previously being used for comparing multiple classifiers, have also been described in brief. The proposed non-parametric tests have also been applied on the classification results on ten real-problem datasets taken from the UCI Machine Learning Database Repository (http://www.ics.uci.edu/mlearn) (Valdovinos and Sanchez, 2009) as case studies.
引用
收藏
页码:410 / 442
页数:33
相关论文
共 50 条
  • [41] Comparison of two atmospheric sampling methodologies with non-parametric statistical tools
    Nunes, MJ
    Camoes, MF
    McGovern, F
    Santos, S
    Raes, F
    QUIMICA NOVA, 2005, 28 (02): : 179 - 182
  • [42] Non-parametric statistical methods for multivariate calibration model selection and comparison
    Thomas, EV
    JOURNAL OF CHEMOMETRICS, 2003, 17 (12) : 653 - 659
  • [43] Non-parametric tests for the comparison of point processes based on incomplete data
    Sun, JG
    Rai, SN
    SCANDINAVIAN JOURNAL OF STATISTICS, 2001, 28 (04) : 725 - 732
  • [44] Non-parametric bootstrap tests for parametric distribution families
    Szucs, Gabor
    ACTA SCIENTIARUM MATHEMATICARUM, 2011, 77 (3-4): : 703 - 723
  • [45] Non-parametric bootstrap tests for parametric distribution families
    Gábor Szűcs
    Acta Scientiarum Mathematicarum, 2011, 77 (3-4): : 703 - 723
  • [46] Statistical Study of Machine Learning Algorithms Using Parametric and Non-Parametric Tests: A Comparative Analysis and Recommendations
    Khadse, Vijay M.
    Mahalle, Parikshit Narendra
    Shinde, Gitanjali R.
    INTERNATIONAL JOURNAL OF AMBIENT COMPUTING AND INTELLIGENCE, 2020, 11 (03) : 80 - 105
  • [47] Reliability Analysis of Non-Parametric Statistical Tests for the Evaluation of Linear Drift in Experimental Data
    Cappa, P.
    Sciuto, S. A.
    Silvestri, S.
    STRAIN, 2001, 37 (02) : 67 - 72
  • [48] Parametric and non-parametric Statistical Process Control in a dairy industry
    Marra da Silva Ribeiro, Luiz Henrique
    de Araujo, Tatiane Gomes
    Ferreira, Eric Batista
    Zamboni, Jovelino Elias
    JOURNAL OF CANDIDO TOSTES DAIRY INSTITUTE, 2018, 73 (03): : 112 - 121
  • [49] A non-parametric significance test to compare corpora
    Koplenig, Alexander
    PLOS ONE, 2019, 14 (09):
  • [50] NON-PARAMETRIC STATISTICAL INFERENCE FOR THE SURVIVAL EXPERIMENTS
    Ramadurai, M.
    Basha, M. A. Ghouse
    JP JOURNAL OF BIOSTATISTICS, 2021, 18 (03) : 379 - 394