Inferring Statistically Significant Hidden Markov Models

被引:16
|
作者
Yu, Lu [1 ]
Schwier, Jason M. [1 ]
Craven, Ryan M. [2 ]
Brooks, Richard R. [1 ]
Griffin, Christopher [3 ]
机构
[1] Clemson Univ, Holcombe Dept Elect & Comp Engn, Clemson, SC 29634 USA
[2] USN, Postgrad Sch, Pebble Beach, CA 93953 USA
[3] Penn State Univ, Appl Res Lab, Commun Informat & Nav Off, University Pk, PA 16802 USA
基金
美国国家科学基金会;
关键词
Hidden Markov models; model confidence; statistical hypothesis testing; z-test; ANONYMITY;
D O I
10.1109/TKDE.2012.93
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hidden Markov models (HMMs) are used to analyze real-world problems. We consider an approach that constructs minimum entropy HMMs directly from a sequence of observations. If an insufficient amount of observation data is used to generate the HMM, the model will not represent the underlying process. Current methods assume that observations completely represent the underlying process. It is often the case that the training data size is not large enough to adequately capture all statistical dependencies in the system. It is, therefore, important to know the statistical significance level for that the constructed model representing the underlying process, not only the training set. In this paper, we present a method to determine if the observation data and constructed model fully express the underlying process with a given level of statistical significance. We use the statistics of the process to calculate an upper bound on the number of samples required to guarantee that the model has a given level significance. We provide theoretical and experimental results that confirm the utility of this approach. The experiment is conducted on a real private Tor network.
引用
收藏
页码:1548 / 1558
页数:11
相关论文
共 50 条
  • [1] Inferring Adaptive Introgression Using Hidden Markov Models
    Svedberg, Jesper
    Shchur, Vladimir
    Reinman, Solomon
    Nielsen, Rasmus
    Corbett-Detig, Russell
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2021, 38 (05) : 2152 - 2165
  • [2] Inferring statistically significant features from random forests
    Paul, Jerome
    Dupont, Pierre
    [J]. NEUROCOMPUTING, 2015, 150 : 471 - 480
  • [3] A New Method for Inferring Hidden Markov Models from Noisy Time Sequences
    Kelly, David
    Dillingham, Mark
    Hudson, Andrew
    Wiesner, Karoline
    [J]. PLOS ONE, 2012, 7 (01):
  • [4] Markov models - hidden Markov models
    Grewal, Jasleen K.
    Krzywinski, Martin
    Altman, Naomi
    [J]. NATURE METHODS, 2019, 16 (09) : 795 - 796
  • [5] Markov models — hidden Markov models
    Jasleen K. Grewal
    Martin Krzywinski
    Naomi Altman
    [J]. Nature Methods, 2019, 16 : 795 - 796
  • [6] Inferring state sequences for non-linear systems with embedded hidden Markov models
    Neal, RM
    Beal, MJ
    Roweis, ST
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 401 - 408
  • [7] Utilizing gaze behavior for inferring task transitions using abstract hidden markov models
    Pinpin, Lord Kenneth
    Gamarra, Daniel Fernando Tello
    Johansson, Roland
    Laschi, Cecilia
    Dario, Paolo
    [J]. Inteligencia Artificial, 2016, 19 (58) : 1 - 16
  • [8] Transformed hidden Markov models: Estimating mixture models of images and inferring spatial transformations in video sequences
    Jojic, N
    Petrovic, N
    Frey, BJ
    Huang, TS
    [J]. IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, VOL II, 2000, : 26 - 33
  • [9] Classifying Eye Gaze Patterns and Inferring Individual Preferences Using Hidden Markov Models
    Chan, Antoni B.
    Coutrot, Antoine
    [J]. I-PERCEPTION, 2017, 8 : 20 - 21
  • [10] Hidden Markov models
    Eddy, SR
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) : 361 - 365