Robust audiovisual integration using semicontinuous hidden Markov models

被引:0
|
作者
Su, Q
Silsbee, PL
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We describe an improved method of integrating audio and visual information in a HMM-based audiovisual ASR system. The method uses a modified semicontinuous HMM (SCHMM) for integration and recognition. Our results show substantial improvements over earlier integration methods at high noise levels. Our integration method relies on the assumption that, as environmental conditions deviate from those under which training occurred, the underlying probability distributions will also change. We use phoneme based SCHMMs for classification of isolated words. The probability models underlying the standard SCHMM are Gaussian; thus, low probability estimates will tend to be associated with high confidences (small differences in the feature values cause large proportional differences in probabilities, when the values are in the tail of the distribution). Therefore, during classification, we replace each Gaussian with a scoring function which looks Gaussian mar the mean of the distribution but has a heavier tail. We report results comparing this method with an audio-only system and with previous integration methods. At high noise levels, the system with modified scoring functions shows a better than 50recognition does suffer when noise is low. Methods which can adjust the relative weight of the audio and visual information can still potentially outperform the new method, provided that a reliable way of choosing those weights can be found.
引用
收藏
页码:42 / 45
页数:4
相关论文
共 50 条
  • [41] Robust facial expression recognition system based on hidden Markov models
    Elgarrai Z.
    Meslouhi O.E.
    Kardouchi M.
    Allali H.
    International Journal of Multimedia Information Retrieval, 2016, 5 (4) : 229 - 236
  • [42] Coupled hidden markov models for robust EO/IR target tracking
    Gai, Jiading
    Li, Yong
    Stevenson, Robert L.
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 41 - 44
  • [43] Robust classification of multivariate time series by imprecise hidden Markov models
    Antonucci, Alessandro
    De Rosa, Rocco
    Giusti, Alessandro
    Cuzzolin, Fabio
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2015, 56 : 249 - 263
  • [44] Robust fitting of hidden Markov regression models under a longitudinal setting
    Maruotti, Antonello
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2014, 84 (08) : 1728 - 1747
  • [45] Denoising and recognition using hidden Markov models with observation distributions modeled by hidden Markov trees
    Milone, Diego H.
    Di Persia, Leandro E.
    Torres, Maria E.
    PATTERN RECOGNITION, 2010, 43 (04) : 1577 - 1589
  • [46] Integration of Auxiliary Features in Hidden Markov Models for Arabic Speech Recognition
    Amrous, Anissa Imen
    Debyeche, Mohamed
    Amrouche, A.
    2009 3RD INTERNATIONAL CONFERENCE ON SIGNALS, CIRCUITS AND SYSTEMS (SCS 2009), 2009, : 612 - 616
  • [47] Feature Selection for Hidden Markov Models and Hidden Semi-Markov Models
    Adams, Stephen
    Beling, Peter A.
    Cogill, Randy
    IEEE ACCESS, 2016, 4 : 1642 - 1657
  • [48] Locating hidden groups in communication networks using hidden Markov models
    Magdon-Ismail, M
    Goldberg, M
    Wallace, W
    Siebecker, D
    INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2003, 2665 : 126 - 137
  • [49] Markov models — training and evaluation of hidden Markov models
    Jasleen K. Grewal
    Martin Krzywinski
    Naomi Altman
    Nature Methods, 2020, 17 : 121 - 122
  • [50] Markov models - training and evaluation of hidden Markov models
    Grewal, Jasleen K.
    Krzywinski, Martin
    Altman, Naomi
    NATURE METHODS, 2020, 17 (02) : 121 - 122