Efficient and accurate inference for mixtures of Mallows models with Spearman distance

被引:0
|
作者
Marta Crispino
Cristina Mollica
Valerio Astuti
Luca Tardella
机构
[1] Bank of Italy,DG of Economics, Statistics and Research
[2] Sapienza University of Rome,Department of Statistical Sciences
来源
Statistics and Computing | 2023年 / 33卷
关键词
Ranking data; Distance-based models; Model-based clustering; EM algorithm; Censoring;
D O I
暂无
中图分类号
学科分类号
摘要
The Mallows model (MM) occupies a central role in parametric modelling of ranking data to learn preferences of a population of judges. Despite the wide range of metrics for rankings that can be considered in the model specification, the choice is typically limited to the Kendall, Cayley or Hamming distances, due to the closed-form expression of the related model normalizing constant. This work instead focuses on the Mallows model with Spearman distance (MMS). A novel approximation of the normalizing constant is introduced to allow inference even with a large number of items. This allows us to develop and implement an efficient and accurate EM algorithm for estimating finite mixtures of MMS aimed at i) enlarging the applicability to samples drawn from heterogeneous populations, and ii) dealing with partial rankings affected by diverse forms of censoring. These novelties encompass the critical inferential steps that traditionally limited the use of this distance in practice, and render the MMS comparable (or even preferable) to the MMs with other metrics in terms of computational burden. The inferential ability of the EM scheme and the effectiveness of the approximation are assessed by extensive simulation studies. Finally, we show that the application to three real-world datasets endorses our proposals also in the comparison with competing mixtures of ranking models.
引用
收藏
相关论文
共 50 条
  • [41] DeepSpeed-Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale
    Aminabadi, Reza Yazdani
    Rajbhandari, Samyam
    Awan, Ammar Ahmad
    Li, Cheng
    Li, Du
    Zheng, Elton
    Ruwase, Olatunji
    Smith, Shaden
    Zhang, Minjia
    Rasley, Jeff
    He, Yuxiong
    [J]. SC22: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2022,
  • [42] Accurate inference of crowdsourcing properties when using efficient allocation strategies
    Abigail Hotaling
    James Bagrow
    [J]. Scientific Reports, 12
  • [43] Accurate inference of crowdsourcing properties when using efficient allocation strategies
    Hotaling, Abigail
    Bagrow, James
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01)
  • [44] FastME 2.0: A Comprehensive, Accurate, and Fast Distance-Based Phylogeny Inference Program
    Lefort, Vincent
    Desper, Richard
    Gascuel, Olivier
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2015, 32 (10) : 2798 - 2800
  • [45] Efficient Hellinger distance estimates for semiparametric models
    Wu, Jingjing
    Karunamuni, Rohana J.
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2012, 107 : 1 - 23
  • [46] Online but Accurate Inference for Latent Variable Models with Local Gibbs Sampling
    Dupuy, Christophe
    Bach, Francis
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18
  • [48] Efficient simulation-based minimum distance estimation and indirect inference
    Nickl R.
    Pötscher B.M.
    [J]. Mathematical Methods of Statistics, 2010, 19 (4) : 327 - 364
  • [49] Efficient Inference in Stochastic Block Models With Vertex Labels
    Stegehuis, Clara
    Massoulie, Laurent
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (03): : 1215 - 1225
  • [50] Computationally efficient inference for latent position network models
    Rastelli, Riccardo
    Maire, Florian
    Friel, Nial
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2024, 18 (01): : 2531 - 2570