Variance-Reduced Stochastic Optimization for Efficient Inference of Hidden Markov Models

被引:0
|
作者
Sidrow, Evan [1 ]
Heckman, Nancy [1 ]
Bouchard-Cote, Alexandre [1 ]
Fortune, Sarah M. E. [2 ]
Trites, Andrew W. [3 ]
Auger-Methe, Marie [4 ]
机构
[1] Univ British Columbia, Dept Stat, Vancouver, BC, Canada
[2] Dalhousie Univ, Dept Oceanog, Halifax, NS, Canada
[3] Univ British Columbia, Inst Oceans & Fisheries, Dept Zool, Vancouver, BC, Canada
[4] Univ British Columbia, Inst Oceans & Fisheries, Dept Stat, Vancouver, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Expectation-maximization algorithm; Maximum likelihood estimation; State space model; Statistical ecology; Stochastic gradient descent; ANIMAL MOVEMENT; MIXTURE-MODELS; EM; LIKELIHOOD; ALGORITHMS; STATES;
D O I
10.1080/10618600.2024.2350476
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Hidden Markov models (HMMs) are popular models to identify a finite number of latent states from sequential data. However, fitting them to large datasets can be computationally demanding because most likelihood maximization techniques require iterating through the entire underlying dataset for every parameter update. We propose a novel optimization algorithm that updates the parameters of an HMM without iterating through the entire dataset. Namely, we combine a partial E step with variance-reduced stochastic optimization within the M step. We prove the algorithm converges under certain regularity conditions. We test our algorithm empirically using a simulation study as well as a case study of kinematic data collected using suction-cup attached biologgers from eight northern resident killer whales (Orcinus orca) off the western coast of Canada. In both, our algorithm converges in fewer epochs, with less computation time, and to regions of higher likelihood compared to standard numerical optimization techniques. Our algorithm allows practitioners to fit complicated HMMs to large time-series datasets more efficiently than existing baselines. Supplemental materials are available online.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Stochastic Variance-Reduced Cubic Regularization for Nonconvex Optimization
    Wang, Zhe
    Zhou, Yi
    Liang, Yingbin
    Lan, Guanghui
    [J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [2] MURANA: A Generic Framework for Stochastic Variance-Reduced Optimization
    Condat, Laurent
    Richtarik, Peter
    [J]. MATHEMATICAL AND SCIENTIFIC MACHINE LEARNING, VOL 190, 2022, 190
  • [3] Variance-Reduced Decentralized Stochastic Optimization With Accelerated Convergence
    Xin, Ran
    Khan, Usman A.
    Kar, Soummya
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 : 6255 - 6271
  • [4] Variance-Reduced and Projection-Free Stochastic Optimization
    Hazan, Elad
    Luo, Haipeng
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [5] Estimate Sequences for Variance-Reduced Stochastic Composite Optimization
    Kulunchakov, Andrei
    Mairal, Julien
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [6] Communication-efficient Variance-reduced Stochastic Gradient Descent
    Ghadikolaei, Hossein S.
    Magnusson, Sindri
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 2648 - 2653
  • [7] Stochastic Variational Inference for Hidden Markov Models
    Foti, Nicholas J.
    Xu, Jason
    Laird, Dillon
    Fox, Emily B.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [8] Stochastic Variance-Reduced Policy Gradient
    Papini, Matteo
    Binaghi, Damiano
    Canonaco, Giuseppe
    Pirotta, Matteo
    Restelli, Marcello
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [9] A DECENTRALIZED VARIANCE-REDUCED METHOD FOR STOCHASTIC OPTIMIZATION OVER DIRECTED GRAPHS
    Qureshi, Muhammad, I
    Xin, Ran
    Kar, Soummya
    Khan, Usman A.
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5030 - 5034
  • [10] Accelerating variance-reduced stochastic gradient methods
    Derek Driggs
    Matthias J. Ehrhardt
    Carola-Bibiane Schönlieb
    [J]. Mathematical Programming, 2022, 191 : 671 - 715