A Note on the Adaptive LASSO for Zero-Inflated Poisson Regression

被引:10
|
作者
Banerjee, Prithish [1 ]
Garai, Broti [2 ]
Mallick, Himel [3 ,4 ]
Chowdhury, Shrabanti [5 ]
Chatterjee, Saptarshi [6 ]
机构
[1] JP Morgan Chase & Co, New York, NY USA
[2] NBCUniversal, New York, NY USA
[3] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[4] Broad Inst MIT & Harvard, Program Med & Populat Genet, Cambridge, MA 02142 USA
[5] Icahn Sch Med Mt Sinai, Dept Genet & Genom Sci, New York, NY 10029 USA
[6] Eli Lilly & Co, Indianapolis, IN 46285 USA
关键词
D O I
10.1155/2018/2834183
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider the problem of modelling count data with excess zeros using Zero-Inflated Poisson (ZIP) regression. Recently, various regularization methods have been developed for variable selection in ZIP models. Among these, EM LASSO is a popular method for simultaneous variable selection and parameter estimation. However, EM LASSO suffers from estimation inefficiency and selection inconsistency. To remedy these problems, we propose a set of EM adaptive LASSO methods using a variety of data-adaptive weights. We show theoretically that the new methods are able to identify the true model consistently, and the resulting estimators can be as efficient as oracle. The methods are further evaluated through extensive synthetic experiments and applied to a German health care demand dataset.
引用
收藏
页数:9
相关论文
共 50 条