Model-based clustering for multivariate partial ranking data

被引：22

作者：

Jacques, Julien ^{[1
,2
,3
]}

Biernacki, Christophe ^{[1
,2
,3
]}

机构：

[1] Univ Lille 1, F-59655 Villeneuve Dascq, France

[2] CNRS, F-75700 Paris, France

[3] Inria, Paris, France

来源：

JOURNAL OF STATISTICAL PLANNING AND INFERENCE | 2014年 / 149卷

关键词：

Multivariate ranking; Partial ranking; Mixture model; Insertion sort rank; SEM algorithm; Gibbs sampling; MIXTURE;

D O I：

10.1016/j.jspi.2014.02.011

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

This paper proposes the first model-based clustering algorithm dedicated to multivariate partial ranking data. This is an extension of the Insertion Sorting Rank (ISR) model for ranking data, which has the dual property to be a meaningful model through its location and scale parameters description and to be a kind of "physical" model through its derivation from the ranking generating process assumed to be an insertion sorting algorithm. The heterogeneity of the rank population is modeled by a mixture of BR, whereas a conditional independence assumption allows the extension to multivariate ranking. Maximum likelihood estimation is performed through a SEM-Gibbs algorithm, and partial rankings are considered as missing data, that allows us to simulate them during the estimation process. After having validated the estimation algorithm as well as the robustness of the model on simulated datasets, three real datasets were studied: the 1980 American Psychological Association (APA) presidential election votes, the results of French students to a general knowledge test and the votes of the European countries to the Eurovision song contest. The proposed model appears to be relevant in comparison with the most standard competitor ranking models (when available) and leads to significant interpretation for each application. In particular, regional alliances between European countries are exhibited in the Eurovision contest, which are often suspected but never proved. (C) 2014 Elsevier B.V. All rights reserved.

引用

页码：201 / 217

页数：17

共 50 条

[31] Model-Based Clustering for Conditionally Correlated Categorical Data
Matthieu Marbac
Christophe Biernacki
Vincent Vandewalle
[J]. Journal of Classification, 2015, 32 : 145 - 175
[32] Model-based clustering and outlier detection with missing data
Hung Tong
Cristina Tortora
[J]. Advances in Data Analysis and Classification, 2022, 16 : 5 - 30
[33] Model-based clustering and analysis of life history data
Scott, Marc A.
Mohan, Kaushik
Gauthier, Jacques-Antoine
[J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2020, 183 (03) : 1231 - 1251
[34] Model-based clustering and outlier detection with missing data
Tong, Hung
Tortora, Cristina
[J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2022, 16 (01) : 5 - 30
[35] Bayesian model-based clustering for longitudinal ordinal data
Roy Costilla
Ivy Liu
Richard Arnold
Daniel Fernández
[J]. Computational Statistics, 2019, 34 : 1015 - 1038
[36] On Model-Based Clustering of Directional Data with Heavy Tails
Yingying Zhang
Volodymyr Melnykov
Igor Melnykov
[J]. Journal of Classification, 2023, 40 (3) : 527 - 551
[37] BAYESIAN MODEL-BASED CLUSTERING FOR POPULATIONS OF NETWORK DATA
Mantziou, Anastasia
Lunagomez, Simon
Mitra, Robin
[J]. ANNALS OF APPLIED STATISTICS, 2024, 18 (01): : 266 - 302
[38] Model-Based Clustering of Inhomogeneous Paired Comparison Data
Busse, Ludwig M.
Buhmann, Joachim M.
[J]. SIMILARITY-BASED PATTERN RECOGNITION: FIRST INTERNATIONAL WORKSHOP, SIMBAD 2011, 2011, 7005 : 207 - 221
[39] Cloud Model-based Data Attributes Reduction for Clustering
Xu Ru-zhi
Nie Pei-yao
Lin Pei-guang
Chu Dong-sheng
[J]. PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, 2008, : 33 - 36
[40] Model-Based Clustering of Mixed Data With Sparse Dependence
Choi, Young-Geun
Ahn, Soohyun
Kim, Jayoun
[J]. IEEE ACCESS, 2023, 11 : 75945 - 75954

← 1 2 3 4 5 →