Two-Component Mixture Model in the Presence of Covariates

被引:4
|
作者
Deb, Nabarun [1 ]
Saha, Sujayam [2 ]
Guntuboyina, Adityanand [3 ]
Sen, Bodhisattva [1 ]
机构
[1] Columbia Univ, Dept Stat, New York, NY 10027 USA
[2] Google Inc, Mountain View, CA USA
[3] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词
Expectation-maximization algorithm; Gaussian location mixture; Identifiability; Local false discovery rate; Nonparametric maximum likelihood; Two-groups model;
D O I
10.1080/01621459.2021.1888739
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this article, we study a generalization of the two-groups model in the presence of covariates-a problem that has recently received much attention in the statistical literature due to its applicability in multiple hypotheses testing problems. The model we consider allows for infinite dimensional parameters and offers flexibility in modeling the dependence of the response on the covariates. We discuss the identifiability issues arising in this model and systematically study several estimation strategies. We propose a tuning parameter-free nonparametric maximum likelihood method, implementable via the expectation-maximization algorithm, to estimate the unknown parameters. Further, we derive the rate of convergence of the proposed estimators-in particular we show that the finite sample Hellinger risk for every 'approximate' nonparametric maximum likelihood estimator achieves a near-parametric rate (up to logarithmic multiplicative factors). In addition, we propose and theoretically study two 'marginal' methods that are more scalable and easily implementable. We demonstrate the efficacy of our procedures through extensive simulation studies and relevant data analyses-one arising from neuroscience and the other from astronomy. We also outline the application of our methods to multiple testing. The companion R package NPMLEmix implements all the procedures proposed in this article.
引用
收藏
页码:1820 / 1834
页数:15
相关论文
共 50 条
  • [1] Semiparametric estimation of a two-component mixture model
    Bordes, Laurent
    Mottelet, Stephane
    Vandekerkhove, Pierre
    ANNALS OF STATISTICS, 2006, 34 (03): : 1204 - 1232
  • [2] Grease flow based on a two-component mixture model
    Tichy, John
    Menut, Marine
    Oumahi, Camella
    Muller, Sandrine
    Bou-Said, Benyebka
    TRIBOLOGY INTERNATIONAL, 2021, 153
  • [3] Grease flow based on a two-component mixture model
    Tichy, John
    Menut, Marine
    Oumahi, Camella
    Muller, Sandrine
    Bou-Saïd, Benyebka
    Tribology International, 2021, 153
  • [4] A two-component nonparametric mixture model with stochastic dominance
    Wu, Jingjing
    Abedin, Tasnima
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2021, 50 (04) : 1029 - 1057
  • [5] A two-component nonparametric mixture model with stochastic dominance
    Jingjing Wu
    Tasnima Abedin
    Journal of the Korean Statistical Society, 2021, 50 : 1029 - 1057
  • [6] A two-component mixture model for density estimation and classification
    Zhu, Degang
    JOURNAL OF INTERDISCIPLINARY MATHEMATICS, 2016, 19 (02) : 311 - 319
  • [7] On finite mixture of two-component Gompertz lifetime model
    Al-Hussaini, EK
    Al-Dayian, GR
    Adham, SA
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2000, 67 (01) : 1 - 20
  • [8] A CONSISTENT KINETIC MODEL FOR A TWO-COMPONENT MIXTURE OF POLYATOMIC MOLECULES
    Klingenberg, Christian
    Pirner, Marlies
    Puppo, Gabriella
    COMMUNICATIONS IN MATHEMATICAL SCIENCES, 2019, 17 (01) : 149 - 173
  • [9] A CONSISTENT KINETIC MODEL FOR A TWO-COMPONENT MIXTURE WITH AN APPLICATION TO PLASMA
    Klingenberg, Christian
    Pirner, Marlies
    Puppo, Gabriella
    KINETIC AND RELATED MODELS, 2017, 10 (02) : 445 - 465
  • [10] Theoretical analysis of power in a two-component normal mixture model
    Hall, P
    Stewart, M
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2005, 134 (01) : 158 - 179