Model-based clustering for multivariate partial ranking data

被引:22
|
作者
Jacques, Julien [1 ,2 ,3 ]
Biernacki, Christophe [1 ,2 ,3 ]
机构
[1] Univ Lille 1, F-59655 Villeneuve Dascq, France
[2] CNRS, F-75700 Paris, France
[3] Inria, Paris, France
关键词
Multivariate ranking; Partial ranking; Mixture model; Insertion sort rank; SEM algorithm; Gibbs sampling; MIXTURE;
D O I
10.1016/j.jspi.2014.02.011
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This paper proposes the first model-based clustering algorithm dedicated to multivariate partial ranking data. This is an extension of the Insertion Sorting Rank (ISR) model for ranking data, which has the dual property to be a meaningful model through its location and scale parameters description and to be a kind of "physical" model through its derivation from the ranking generating process assumed to be an insertion sorting algorithm. The heterogeneity of the rank population is modeled by a mixture of BR, whereas a conditional independence assumption allows the extension to multivariate ranking. Maximum likelihood estimation is performed through a SEM-Gibbs algorithm, and partial rankings are considered as missing data, that allows us to simulate them during the estimation process. After having validated the estimation algorithm as well as the robustness of the model on simulated datasets, three real datasets were studied: the 1980 American Psychological Association (APA) presidential election votes, the results of French students to a general knowledge test and the votes of the European countries to the Eurovision song contest. The proposed model appears to be relevant in comparison with the most standard competitor ranking models (when available) and leads to significant interpretation for each application. In particular, regional alliances between European countries are exhibited in the Eurovision contest, which are often suspected but never proved. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:201 / 217
页数:17
相关论文
共 50 条
  • [31] Model-Based Clustering for Conditionally Correlated Categorical Data
    Matthieu Marbac
    Christophe Biernacki
    Vincent Vandewalle
    [J]. Journal of Classification, 2015, 32 : 145 - 175
  • [32] Model-based clustering and outlier detection with missing data
    Hung Tong
    Cristina Tortora
    [J]. Advances in Data Analysis and Classification, 2022, 16 : 5 - 30
  • [33] Model-based clustering and analysis of life history data
    Scott, Marc A.
    Mohan, Kaushik
    Gauthier, Jacques-Antoine
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2020, 183 (03) : 1231 - 1251
  • [34] Model-based clustering and outlier detection with missing data
    Tong, Hung
    Tortora, Cristina
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2022, 16 (01) : 5 - 30
  • [35] Bayesian model-based clustering for longitudinal ordinal data
    Roy Costilla
    Ivy Liu
    Richard Arnold
    Daniel Fernández
    [J]. Computational Statistics, 2019, 34 : 1015 - 1038
  • [36] On Model-Based Clustering of Directional Data with Heavy Tails
    Yingying Zhang
    Volodymyr Melnykov
    Igor Melnykov
    [J]. Journal of Classification, 2023, 40 (3) : 527 - 551
  • [37] BAYESIAN MODEL-BASED CLUSTERING FOR POPULATIONS OF NETWORK DATA
    Mantziou, Anastasia
    Lunagomez, Simon
    Mitra, Robin
    [J]. ANNALS OF APPLIED STATISTICS, 2024, 18 (01): : 266 - 302
  • [38] Model-Based Clustering of Inhomogeneous Paired Comparison Data
    Busse, Ludwig M.
    Buhmann, Joachim M.
    [J]. SIMILARITY-BASED PATTERN RECOGNITION: FIRST INTERNATIONAL WORKSHOP, SIMBAD 2011, 2011, 7005 : 207 - 221
  • [39] Cloud Model-based Data Attributes Reduction for Clustering
    Xu Ru-zhi
    Nie Pei-yao
    Lin Pei-guang
    Chu Dong-sheng
    [J]. PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, 2008, : 33 - 36
  • [40] Model-Based Clustering of Mixed Data With Sparse Dependence
    Choi, Young-Geun
    Ahn, Soohyun
    Kim, Jayoun
    [J]. IEEE ACCESS, 2023, 11 : 75945 - 75954