A hypothesis test for comparing two partitions obtained from the same dataset

被引:0
|
作者
Bourel, Mathias [1 ]
Ghattas, Badih [2 ]
Gonzalez, Meliza [1 ]
机构
[1] Univ Republica, Inst Matemat & Estadist, Montevideo, Uruguay
[2] Aix Marseille Sch Econ, Marseille, France
关键词
Clustering; Comparing partitions; Hypothesis test; Matching error; CLUSTERINGS; CRITERIA;
D O I
10.1080/03610918.2025.2458574
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a non parametric hypothesis test to compare two partitions of a same data set. The partitions may result from two different clustering approaches. The test may be done using any comparison index but we focus in particular on the Matching Error (ME) that is related to the misclassification error in supervised learning. Some properties of the ME and, especially, its distribution function for the case of two different partitions are analyzed. Extensive simulations and experiments show the efficiency of the test.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] Same data, different conclusions: Radical dispersion in empirical results when independent analysts operationalize and test the same hypothesis
    Schweinsberg, Martin
    Feldman, Michael
    Staub, Nicola
    van den Akker, Olmo R.
    van Aert, Robbie C. M.
    van Assen, Marcel A. L. M.
    Liu, Yang
    Althoff, Tim
    Heer, Jeffrey
    Kale, Alex
    Mohamed, Zainab
    Amireh, Hashem
    Prasad, Vaishali Venkatesh
    Bernstein, Abraham
    Robinson, Emily
    Snellman, Kaisa
    Sommer, S. Amy
    Otner, Sarah M. G.
    Robinson, David
    Madan, Nikhil
    Silberzahn, Raphael
    Goldstein, Pavel
    Tierney, Warren
    Murase, Toshio
    Mandl, Benjamin
    Viganola, Domenico
    Strobl, Carolin
    Schaumans, Catherine B. C.
    Kelchtermans, Stijn
    Naseeb, Chan
    Garrison, S. Mason
    Yarkoni, Tal
    Chan, C. S. Richard
    Adie, Prestone
    Alaburda, Paulius
    Albers, Casper
    Alspaugh, Sara
    Alstott, Jeff
    Nelson, Andrew A.
    de la Rubia, Eduardo Arinno
    Arzi, Adbi
    Bahnik, Stepan
    Baik, Jason
    Balling, Laura Winther
    Banker, Sachin
    Baranger, David A. A.
    Barr, Dale J.
    Barros-Rivera, Brenda
    Bauer, Matt
    Blaise, Enuh
    ORGANIZATIONAL BEHAVIOR AND HUMAN DECISION PROCESSES, 2021, 165 : 228 - 249
  • [42] COMPARING ORANGES AND ORANGES: UNLIKELY TO FIND A DIFFERENCE IN TWO OF THE SAME THING
    Gupta, Vaibhav
    Kidane, Biniam
    JOURNAL OF THORACIC AND CARDIOVASCULAR SURGERY, 2018, 156 (05): : 2018 - 2018
  • [43] Comparing the functional range of English to be to German sein: a test of the boundary permeability hypothesis
    Berg, Thomas
    CORPUS LINGUISTICS AND LINGUISTIC THEORY, 2023, 19 (03) : 371 - 396
  • [44] Comparing relative importance of the same component in a series system in two environments
    Deshpande, J. V.
    Naik-Nimbalkar, U. V.
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2007, 137 (11) : 3410 - 3415
  • [45] Comparing Sleep Segmentation Between Traditional and Western Populations: a Test of the Sentinel Hypothesis
    Shattuck, Eric C.
    Samson, David R.
    AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY, 2019, 168 : 226 - 226
  • [47] Same system, different outcomes: Comparing the transitions from two paper-based systems to the same computerized physician order entry system
    Niazkhani, Zahra
    van der Sijs, Heleen
    Pirnejad, Habibollah
    Redekop, William K.
    Aarts, Jos
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2009, 78 (03) : 170 - 181
  • [48] Comparing Papanicolaou test results obtained during pregnancy and post-partum
    Suzuki, Kazuhiro
    Furuhashi, Madoka
    Kawamura, Takuya
    Kubo, Michiko
    Osato, Kazuhiro
    Yamawaki, Takaharu
    JOURNAL OF OBSTETRICS AND GYNAECOLOGY RESEARCH, 2017, 43 (04) : 705 - 709
  • [49] Comparing the estimates obtained from ordinary and robust kriging
    Costa, JF
    White, AH
    Vilhena, MTMB
    26TH PROCEEDINGS OF THE APPLICATIONS OF COMPUTERS AND OPERATIONS RESEARCH IN THE MINERAL INDUSTRY, 1996, : 47 - 52
  • [50] A more powerful test for comparing two Poisson means
    Krishnamoorthy, K
    Thomson, J
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2004, 119 (01) : 23 - 35