Variance-Reduced Stochastic Optimization for Efficient Inference of Hidden Markov Models

被引：0

作者：

Sidrow, Evan ^{[1
]}

Heckman, Nancy ^{[1
]}

Bouchard-Cote, Alexandre ^{[1
]}

Fortune, Sarah M. E. ^{[2
]}

Trites, Andrew W. ^{[3
]}

Auger-Methe, Marie ^{[4
]}

机构：

[1] Univ British Columbia, Dept Stat, Vancouver, BC, Canada

[2] Dalhousie Univ, Dept Oceanog, Halifax, NS, Canada

[3] Univ British Columbia, Inst Oceans & Fisheries, Dept Zool, Vancouver, BC, Canada

[4] Univ British Columbia, Inst Oceans & Fisheries, Dept Stat, Vancouver, BC, Canada

来源：

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS | 2024年

基金：

加拿大自然科学与工程研究理事会;

关键词：

Expectation-maximization algorithm; Maximum likelihood estimation; State space model; Statistical ecology; Stochastic gradient descent; ANIMAL MOVEMENT; MIXTURE-MODELS; EM; LIKELIHOOD; ALGORITHMS; STATES;

D O I：

10.1080/10618600.2024.2350476

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Hidden Markov models (HMMs) are popular models to identify a finite number of latent states from sequential data. However, fitting them to large datasets can be computationally demanding because most likelihood maximization techniques require iterating through the entire underlying dataset for every parameter update. We propose a novel optimization algorithm that updates the parameters of an HMM without iterating through the entire dataset. Namely, we combine a partial E step with variance-reduced stochastic optimization within the M step. We prove the algorithm converges under certain regularity conditions. We test our algorithm empirically using a simulation study as well as a case study of kinematic data collected using suction-cup attached biologgers from eight northern resident killer whales (Orcinus orca) off the western coast of Canada. In both, our algorithm converges in fewer epochs, with less computation time, and to regions of higher likelihood compared to standard numerical optimization techniques. Our algorithm allows practitioners to fit complicated HMMs to large time-series datasets more efficiently than existing baselines. Supplemental materials are available online.

引用

页数：17

共 50 条

[1] Stochastic Variance-Reduced Cubic Regularization for Nonconvex Optimization
Wang, Zhe
Zhou, Yi
Liang, Yingbin
Lan, Guanghui
[J]. 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
[2] MURANA: A Generic Framework for Stochastic Variance-Reduced Optimization
Condat, Laurent
Richtarik, Peter
[J]. MATHEMATICAL AND SCIENTIFIC MACHINE LEARNING, VOL 190, 2022, 190
[3] Variance-Reduced Decentralized Stochastic Optimization With Accelerated Convergence
Xin, Ran
Khan, Usman A.
Kar, Soummya
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2020, 68 : 6255 - 6271
[4] Variance-Reduced and Projection-Free Stochastic Optimization
Hazan, Elad
Luo, Haipeng
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
[5] Estimate Sequences for Variance-Reduced Stochastic Composite Optimization
Kulunchakov, Andrei
Mairal, Julien
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
[6] Communication-efficient Variance-reduced Stochastic Gradient Descent
Ghadikolaei, Hossein S.
Magnusson, Sindri
[J]. IFAC PAPERSONLINE, 2020, 53 (02): : 2648 - 2653
[7] Stochastic Variational Inference for Hidden Markov Models
Foti, Nicholas J.
Xu, Jason
Laird, Dillon
Fox, Emily B.
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
[8] Stochastic Variance-Reduced Policy Gradient
Papini, Matteo
Binaghi, Damiano
Canonaco, Giuseppe
Pirotta, Matteo
Restelli, Marcello
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[9] A DECENTRALIZED VARIANCE-REDUCED METHOD FOR STOCHASTIC OPTIMIZATION OVER DIRECTED GRAPHS
Qureshi, Muhammad, I
Xin, Ran
Kar, Soummya
Khan, Usman A.
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5030 - 5034
[10] Accelerating variance-reduced stochastic gradient methods
Derek Driggs
Matthias J. Ehrhardt
Carola-Bibiane Schönlieb
[J]. Mathematical Programming, 2022, 191 : 671 - 715

← 1 2 3 4 5 →