Robust audiovisual integration using semicontinuous hidden Markov models

被引:0
|
作者
Su, Q
Silsbee, PL
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We describe an improved method of integrating audio and visual information in a HMM-based audiovisual ASR system. The method uses a modified semicontinuous HMM (SCHMM) for integration and recognition. Our results show substantial improvements over earlier integration methods at high noise levels. Our integration method relies on the assumption that, as environmental conditions deviate from those under which training occurred, the underlying probability distributions will also change. We use phoneme based SCHMMs for classification of isolated words. The probability models underlying the standard SCHMM are Gaussian; thus, low probability estimates will tend to be associated with high confidences (small differences in the feature values cause large proportional differences in probabilities, when the values are in the tail of the distribution). Therefore, during classification, we replace each Gaussian with a scoring function which looks Gaussian mar the mean of the distribution but has a heavier tail. We report results comparing this method with an audio-only system and with previous integration methods. At high noise levels, the system with modified scoring functions shows a better than 50recognition does suffer when noise is low. Methods which can adjust the relative weight of the audio and visual information can still potentially outperform the new method, provided that a reliable way of choosing those weights can be found.
引用
收藏
页码:42 / 45
页数:4
相关论文
共 50 条
  • [21] Classic cryptanalysis using hidden Markov models
    Vobbilisetty, Rohit
    Di Troia, Fabio
    Low, Richard M.
    Visaggio, Corrado Aaron
    Stamp, Mark
    CRYPTOLOGIA, 2017, 41 (01) : 1 - 28
  • [22] Classification of chirps using Hidden Markov Models
    Balachandran, Nikhil
    Creusere, Charles
    2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 545 - +
  • [23] Cough Detection Using Hidden Markov Models
    Teyhouee, Aydin
    Osgood, Nathaniel D.
    SOCIAL, CULTURAL, AND BEHAVIORAL MODELING, SBP-BRIMS 2019, 2019, 11549 : 266 - 276
  • [24] Address extraction using hidden Markov models
    Taghva, K
    Coombs, J
    Pereda, R
    Nartker, T
    Document Recognition and Retrieval XII, 2005, 5676 : 119 - 126
  • [25] Phonocardiogram segmentation by using hidden markov models
    Lima, Carlos S.
    Cardoso, Manuel. J.
    PROCEEDINGS OF THE FIFTH IASTED INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING, 2007, : 415 - 418
  • [26] Ionogram Scaling using Hidden Markov Models
    Gok, Gokhan
    Alp, Yasar Kemal
    Arikan, Orhan
    Arikan, Feza
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [27] Classification of electrocardiogram using hidden Markov models
    Cheng, WT
    Chan, KL
    PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 20, PTS 1-6: BIOMEDICAL ENGINEERING TOWARDS THE YEAR 2000 AND BEYOND, 1998, 20 : 143 - 146
  • [28] Speaker identification using hidden Markov models
    Inman, M
    Danforth, D
    Hangai, S
    Sato, K
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 609 - 612
  • [29] Image distance using hidden Markov models
    DeMenthon, D
    Doermann, D
    Stückelberg, MV
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 143 - 146
  • [30] A Virtual Director Using Hidden Markov Models
    Merabti, B.
    Christie, M.
    Bouatouch, K.
    COMPUTER GRAPHICS FORUM, 2016, 35 (08) : 51 - 67