PENALIZED ESTIMATION IN HIGH-DIMENSIONAL HIDDEN MARKOV MODELS WITH STATE-SPECIFIC GRAPHICAL MODELS

被引:16
|
作者
Stadler, Nicolas [1 ]
Mukherjee, Sach [1 ]
机构
[1] Netherlands Canc Inst, Dept Biochem, NL-1066 CX Amsterdam, Netherlands
来源
ANNALS OF APPLIED STATISTICS | 2013年 / 7卷 / 04期
关键词
HMM; Graphical Lasso; universal regularization; model selection; MMDL; greedy backward pruning; genome biology; chromatin modeling; VARIABLE SELECTION; MIXTURE;
D O I
10.1214/13-AOAS662
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider penalized estimation in hidden Markov models (HMMs) with multivariate Normal observations. In the moderate-to-large dimensional setting, estimation for HMMs remains challenging in practice, due to several concerns arising from the hidden nature of the states. We address these concerns by l(1)-penalization of state-specific inverse covariance matrices. Penalized estimation leads to sparse inverse covariance matrices which can be interpreted as state-specific conditional independence graphs. Penalization is nontrivial in this latent variable setting; we propose a penalty that automatically adapts to the number of states K and the state-specific sample sizes and can cope with scaling issues arising from the unknown states. The methodology is adaptive and very general, applying in particular to both low- and high-dimensional settings without requiring hand tuning. Furthermore, our approach facilitates exploration of the number of states K by coupling estimation for successive candidate values K. Empirical results on simulated examples demonstrate the effectiveness of the proposed approach. In a challenging real data example from genome biology, we demonstrate the ability of our approach to yield gains in predictive power and to deliver richer estimates than existing methods.
引用
收藏
页码:2157 / 2179
页数:23
相关论文
共 50 条
  • [21] Graphical Models for Discrete Hidden Markov Models in Speech Recognition
    Miguel, Antonio
    Ortega, Alfonso
    Buera, Luis
    Lleida, Eduardo
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1387 - 1390
  • [22] Penalized empirical likelihood for high-dimensional generalized linear models
    Chen, Xia
    Mao, Liyue
    STATISTICS AND ITS INTERFACE, 2021, 14 (02) : 83 - 94
  • [23] High-Dimensional Gaussian Graphical Regression Models with Covariates
    Zhang, Jingfei
    Li, Yi
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (543) : 2088 - 2100
  • [24] HIGH-DIMENSIONAL SEMIPARAMETRIC GAUSSIAN COPULA GRAPHICAL MODELS
    Liu, Han
    Han, Fang
    Yuan, Ming
    Lafferty, John
    Wasserman, Larry
    ANNALS OF STATISTICS, 2012, 40 (04): : 2293 - 2326
  • [25] Inference for High-dimensional Exponential Family Graphical Models
    Wang, Jialei
    Kolar, Mladen
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 1042 - 1050
  • [26] Experiments in stochastic computation for high-dimensional graphical models
    Jones, B
    Carvalho, C
    Dobra, A
    Hans, C
    Carter, C
    West, M
    STATISTICAL SCIENCE, 2005, 20 (04) : 388 - 400
  • [27] Uniform inference in high-dimensional Gaussian graphical models
    Klaassen, S.
    Kueck, J.
    Spindler, M.
    Chernozhukov, V
    BIOMETRIKA, 2023, 110 (01) : 51 - 68
  • [28] Ensemble of penalized logistic models for classification of high-dimensional data
    Ijaz, Musarrat
    Asghar, Zahid
    Gul, Asma
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2021, 50 (07) : 2072 - 2088
  • [29] Penalized least-squares estimation for regression coefficients in high-dimensional partially linear models
    Ni, Huey-Fan
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (02) : 379 - 389
  • [30] Efficient Distributed Estimation of High-dimensional Sparse Precision Matrix for Transelliptical Graphical Models
    Guan Peng WANG
    Heng Jian CUI
    Acta Mathematica Sinica,English Series, 2021, 37 (05) : 689 - 706