Hidden Conditional Random Fields for Visual Speech Recognition

被引:1
|
作者
Pass, Adrian [1 ]
Zhang, Jianguo [1 ]
Stewart, Darryl [1 ]
机构
[1] Queens Univ Belfast, Sch Elect Elect Engn & Comp Sci, Belfast BT7 1NN, Antrim, North Ireland
关键词
D O I
10.1109/IMVIP.2009.28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present the application of Hidden Conditional Random Fields (HCRFs) to modeling speech for visual speech recognition. HCRFs may be easily adapted to model long range dependencies across an observation sequence. As a result visual word recognition performance can be improved as the model is able to take more of a contextual approach to generating state sequences. Results are presented from a speaker-dependent, isolated digit, visual speech recognition task using comparisons with a baseline HMM system. We firstly illustrate that word recognition rates on clean video using HCRFs can be improved by increasing the number of past and future observations being taken into account by each state. Secondly we compare model performances using various levels of video compression on the test set. As far as we are aware this is the first attempted use of HCRFs for visual speech recognition.
引用
收藏
页码:117 / 122
页数:6
相关论文
共 50 条
  • [1] Hidden Conditional Random Fields for Phone Recognition
    Sung, Yun-Hsuan
    Jurafsky, Dan
    [J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 107 - 112
  • [2] Hidden Conditional Random Fields for Gait Recognition
    Hagui, Mabrouka
    Mahjoub, Mohamed Ali
    [J]. 2016 SECOND INTERNATIONAL IMAGE PROCESSING, APPLICATIONS AND SYSTEMS (IPAS), 2016,
  • [3] Hidden Conditional Random Fields for Face Recognition
    Yang, Huachun
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND APPLICATIONS (CSA), 2013, : 337 - 340
  • [4] Hidden Conditional Random Fields for Face Recognition
    Yang, Huachun
    [J]. INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2012), 2013, 8768
  • [5] Hidden Conditional Random Fields for Action Recognition
    Chen, Lifang
    van der Aa, Nico
    Tan, Robby T.
    Veltkamp, Remco C.
    [J]. PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 240 - 247
  • [6] Minimum Classification Error Training of Hidden Conditional Random Fields for Speech and Speaker Recognition
    Hong, Wei-Tyng
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2013, 29 (04) : 729 - 742
  • [7] Speech Recognition Using Augmented Conditional Random Fields
    Hifny, Yasser
    Renals, Steve
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (02): : 354 - 365
  • [8] Hidden Conditional Neural Fields for Continuous Phoneme Speech Recognition
    Fujii, Yasuhisa
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (08): : 2094 - 2104
  • [9] AUTOMATIC SPEECH RECOGNITION USING HIDDEN CONDITIONAL NEURAL FIELDS
    Fujii, Yasuhisa
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5036 - 5039
  • [10] Hand Posture Recognition Using Hidden Conditional Random Fields
    Liu, Te-Cheng
    Wang, Ko-Chih
    Tsai, Augustine
    Wang, Chieh-Chih
    [J]. 2009 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS, VOLS 1-3, 2009, : 1817 - +