Towards Better Estimation of Statistical Significance When Comparing Evolutionary Algorithms

被引:0
|
作者
Buzdalov, Maxim [1 ]
机构
[1] ITMO Univ, St Petersburg, Russia
基金
俄罗斯科学基金会;
关键词
Multiple comparisons; statistical significance; TESTS;
D O I
10.1145/3319619.3326899
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The use of well-established statistical testing procedures to compare the performance of evolutionary algorithms often yields pessimistic results. This requires increasing the number of independent samples, and thus the computation time, in order to get results with the necessary precision. We aim at improving this situation by developing statistical tests that are good in answering typical questions coming from benchmarking of evolutionary algorithms. Our first step, presented in this paper, is a procedure that determines whether the performance distributions of two given algorithms are identical for each of the benchmarks. Our experimental study shows that this procedure is able to spot very small differences in the performance of algorithms while requiring computational budgets which are by an order of magnitude smaller (e.g. 15x) compared to the existing approaches.
引用
收藏
页码:1782 / 1788
页数:7
相关论文
共 50 条
  • [1] When Is an Estimation of Distribution Algorithm Better than an Evolutionary Algorithm?
    Chen, Tianshi
    Lehre, Per Kristian
    Tang, Ke
    Yao, Xin
    2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 1470 - +
  • [2] Towards Statistical Convergence Criteria for Mutation-Based Evolutionary Algorithms
    Campelo, Felipe
    2015 LATIN AMERICA CONGRESS ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2015,
  • [3] WHEN STATISTICAL SIGNIFICANCE IS NOT ENOUGH: INVESTIGATING RELEVANCE, PRACTICAL SIGNIFICANCE, AND STATISTICAL SIGNIFICANCE
    Mohajeri, Kaveh
    Mesgari, Mostafa
    Lee, Allen S.
    MIS QUARTERLY, 2020, 44 (02) : 525 - 559
  • [4] A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms
    Derrac, Joaquin
    Garcia, Salvador
    Molina, Daniel
    Herrera, Francisco
    SWARM AND EVOLUTIONARY COMPUTATION, 2011, 1 (01) : 3 - 18
  • [5] Design of evolutionary algorithms -: A statistical perspective
    François, O
    Lavergne, C
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2001, 5 (02) : 129 - 148
  • [6] Statistical significance comparing cricothyroidotomy techniques
    McCracken, G. C.
    ANAESTHESIA, 2019, 74 (02) : 249 - 250
  • [7] Towards a Better Diversity of Evolutionary Multi-Criterion Optimization Algorithms using Local Searches
    Seada, Haitham
    Abouhawwash, Mohamed
    Deb, Kalyanmoy
    PROCEEDINGS OF THE 2016 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'16 COMPANION), 2016, : 77 - 78
  • [8] Comparing evolutionary algorithms on the problem of network inference
    Spieth, Christian
    Worzischek, Rene
    Streichert, Felix
    GECCO 2006: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOL 1 AND 2, 2006, : 305 - +
  • [9] Comparing Parameter Tuning Methods for Evolutionary Algorithms
    Smit, S. K.
    Eiben, A. E.
    2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 399 - 406
  • [10] Offspring Population Size Matters when Comparing Evolutionary Algorithms with Self-Adjusting Mutation Rates
    Rodionova, Anna
    Antonov, Kirill
    Buzdalova, Arina
    Doerr, Carola
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'19), 2019, : 855 - 863