Hidden Conditional Random Fields for Visual Speech Recognition

被引：1

作者：

Pass, Adrian ^{[1
]}

Zhang, Jianguo ^{[1
]}

Stewart, Darryl ^{[1
]}

机构：

[1] Queens Univ Belfast, Sch Elect Elect Engn & Comp Sci, Belfast BT7 1NN, Antrim, North Ireland

来源：

2009 13TH INTERNATIONAL MACHINE VISION AND IMAGE PROCESSING CONFERENCE | 2009年

关键词：

D O I：

10.1109/IMVIP.2009.28

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we present the application of Hidden Conditional Random Fields (HCRFs) to modeling speech for visual speech recognition. HCRFs may be easily adapted to model long range dependencies across an observation sequence. As a result visual word recognition performance can be improved as the model is able to take more of a contextual approach to generating state sequences. Results are presented from a speaker-dependent, isolated digit, visual speech recognition task using comparisons with a baseline HMM system. We firstly illustrate that word recognition rates on clean video using HCRFs can be improved by increasing the number of past and future observations being taken into account by each state. Secondly we compare model performances using various levels of video compression on the test set. As far as we are aware this is the first attempted use of HCRFs for visual speech recognition.

引用

页码：117 / 122

页数：6

共 50 条

[1] Hidden Conditional Random Fields for Phone Recognition
Sung, Yun-Hsuan
Jurafsky, Dan
[J]. 2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 107 - 112
[2] Hidden Conditional Random Fields for Gait Recognition
Hagui, Mabrouka
Mahjoub, Mohamed Ali
[J]. 2016 SECOND INTERNATIONAL IMAGE PROCESSING, APPLICATIONS AND SYSTEMS (IPAS), 2016,
[3] Hidden Conditional Random Fields for Face Recognition
Yang, Huachun
[J]. 2013 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND APPLICATIONS (CSA), 2013, : 337 - 340
[4] Hidden Conditional Random Fields for Face Recognition
Yang, Huachun
[J]. INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2012), 2013, 8768
[5] Hidden Conditional Random Fields for Action Recognition
Chen, Lifang
van der Aa, Nico
Tan, Robby T.
Veltkamp, Remco C.
[J]. PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 240 - 247
[6] Minimum Classification Error Training of Hidden Conditional Random Fields for Speech and Speaker Recognition
Hong, Wei-Tyng
[J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2013, 29 (04) : 729 - 742
[7] Speech Recognition Using Augmented Conditional Random Fields
Hifny, Yasser
Renals, Steve
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (02): : 354 - 365
[8] Hidden Conditional Neural Fields for Continuous Phoneme Speech Recognition
Fujii, Yasuhisa
Yamamoto, Kazumasa
Nakagawa, Seiichi
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (08): : 2094 - 2104
[9] AUTOMATIC SPEECH RECOGNITION USING HIDDEN CONDITIONAL NEURAL FIELDS
Fujii, Yasuhisa
Yamamoto, Kazumasa
Nakagawa, Seiichi
[J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5036 - 5039
[10] Hand Posture Recognition Using Hidden Conditional Random Fields
Liu, Te-Cheng
Wang, Ko-Chih
Tsai, Augustine
Wang, Chieh-Chih
[J]. 2009 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS, VOLS 1-3, 2009, : 1817 - +

← 1 2 3 4 5 →