Dealing with overdispersion in multivariate count data

被引:4
|
作者
Corsini, Noemi [1 ]
Viroli, Cinzia [1 ]
机构
[1] Univ Bologna, Dept Stat Sci, via Belle Arti 41, I-40126 Bologna, Italy
关键词
Extra-variation; Mixture models; Deep learning; Maximum likelihood; FINITE MIXTURE DISTRIBUTION; ZERO-INFLATED POISSON; REGRESSION; MODEL;
D O I
10.1016/j.csda.2022.107447
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The problem of overdispersion in multivariate count data is a challenging issue. It covers a central role mainly due to the relevance of modern technology-based data, such as Next Generation Sequencing and textual data from the web or digital collections. A comprehensive analysis of the likelihood-based models for extra-variation data is presented. Particular attention is paid to the models feasible for high-dimensional data. A new approach together with its parametric-estimation procedure is proposed. It can be viewed as a deeper version of the Dirichlet-Multinomial distribution and it leads to important results allowing to get a better approximation of the observed variability. A significative comparison of the proposed model and existing strategies is made through two different simulation studies and an empirical data set, that confirm a better capability to describe overdispersion. (C) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Analysing longitudinal count data with overdispersion
    Jowaheer, V
    Sutradhar, BC
    [J]. BIOMETRIKA, 2002, 89 (02) : 389 - 399
  • [2] Approaches for dealing with various sources of overdispersion in modeling count data: Scale adjustment versus modeling
    Payne, Elizabeth H.
    Hardin, James W.
    Egede, Leonard E.
    Ramakrishnan, Viswanathan
    Selassie, Anbesaw
    Gebregziabher, Mulugeta
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2017, 26 (04) : 1802 - 1823
  • [3] OVERDISPERSION TESTS IN COUNT-DATA ANALYSIS
    Vives, Jaume
    Losilla, Josep-Maria
    Rodrigo, Maria-Florencia
    Portell, Mariona
    [J]. PSYCHOLOGICAL REPORTS, 2008, 103 (01) : 145 - 160
  • [4] Modelling count data with overdispersion and spatial effects
    Susanne Gschlößl
    Claudia Czado
    [J]. Statistical Papers, 2008, 49
  • [5] Modelling overdispersion and Markovian features in count data
    Iñaki F. Trocóniz
    Elodie L. Plan
    Raymond Miller
    Mats O. Karlsson
    [J]. Journal of Pharmacokinetics and Pharmacodynamics, 2009, 36 : 461 - 477
  • [6] Modelling count data with overdispersion and spatial effects
    Gschloessl, Susanne
    Czado, Claudia
    [J]. STATISTICAL PAPERS, 2008, 49 (03) : 531 - 552
  • [7] Modelling overdispersion and Markovian features in count data
    Troconiz, Inaki F.
    Plan, Elodie L.
    Miller, Raymond
    Karlsson, Mats O.
    [J]. JOURNAL OF PHARMACOKINETICS AND PHARMACODYNAMICS, 2009, 36 (05) : 461 - 477
  • [8] Goodness-of-Fit for Longitudinal Count Data with Overdispersion
    Xu, Wangli
    Lu, Yiqiang
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2009, 38 (20) : 3745 - 3754
  • [9] Detection of outliers in longitudinal count data via overdispersion
    Gumedze, Freedom N.
    Chatora, Tinashe D.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 79 : 192 - 202
  • [10] A Bayesian Approach to Account for Misclassification and Overdispersion in Count Data
    Wu, Wenqi
    Stamey, James
    Kahle, David
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2015, 12 (09): : 10648 - 10661