Automatic Speechreading with Applications to Human-Computer Interfaces

被引:0
|
作者
Xiaozheng Zhang
Charles C. Broun
Russell M. Mersereau
Mark A. Clements
机构
[1] Georgia Institute of Technology,Center for Signal and Image Processing
[2] Motorola Human Interface Lab,undefined
关键词
automatic speechreading; visual feature extraction; Markov random fields; hidden Markov models; polynomial classifier; speech recognition; speaker verification;
D O I
暂无
中图分类号
学科分类号
摘要
There has been growing interest in introducing speech as a new modality into the human-computer interface (HCI). Motivated by the multimodal nature of speech, the visual component is considered to yield information that is not always present in the acoustic signal and enables improved system performance over acoustic-only methods, especially in noisy environments. In this paper, we investigate the usefulness of visual speech information in HCI related applications. We first introduce a new algorithm for automatically locating the mouth region by using color and motion information and segmenting the lip region by making use of both color and edge information based on Markov random fields. We then derive a relevant set of visual speech parameters and incorporate them into a recognition engine. We present various visual feature performance comparisons to explore their impact on the recognition accuracy, including the lip inner contour and the visibility of the tongue and teeth. By using a common visual feature set, we demonstrate two applications that exploit speechreading in a joint audio-visual speech signal processing task: speech recognition and speaker verification. The experimental results based on two databases demonstrate that the visual information is highly effective for improving recognition performance over a variety of acoustic noise levels.
引用
收藏
相关论文
共 50 条
  • [41] Revolutionizing human-computer interfaces-the auditory perspective
    Patel, Neel S.
    Hughes, Darin E.
    Interactions, 2012, 19 (01) : 34 - 37
  • [42] A new approach to perceptual assessment of human-computer interfaces
    Alessandro Rizzi
    Daniela Fogli
    Barbara Rita Barricelli
    Multimedia Tools and Applications, 2017, 76 : 7381 - 7399
  • [43] Impact of familiarity on information complexity in human-computer interfaces
    Bakaev, Maxim
    2016 INTERNATIONAL CONFERENCE ON MEASUREMENT INSTRUMENTATION AND ELECTRONICS (ICMIE 2016), 2016, 75
  • [44] INVESTIGATING THE GRANULARITY OF THE UNDO FUNCTION IN HUMAN-COMPUTER INTERFACES
    LENMAN, S
    ROBERT, JM
    APPLIED PSYCHOLOGY-AN INTERNATIONAL REVIEW-PSYCHOLOGIE APPLIQUEE-REVUE INTERNATIONALE, 1994, 43 (04): : 543 - 564
  • [45] Considerations in Designing Human-Computer Interfaces for Elderly People
    Williams, Drew
    Ul Alam, Mohammad Arif
    Ahamed, Sheikh Iqbal
    Chu, William
    2013 13TH INTERNATIONAL CONFERENCE ON QUALITY SOFTWARE (QSIC), 2013, : 372 - 377
  • [46] Brain Computer Interfaces as Intelligent Sensors for Enhancing Human-Computer Interaction
    Poel, Mannes
    Nijboer, Femke
    van den Broek, Egon L.
    Fairclough, Stephen
    Nijholt, Anton
    ICMI '12: PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2012, : 379 - 382
  • [47] Hand-gesture and facial-expression human-computer interfaces for intelligent space applications
    Chen, Qing
    Cordea, Marius D.
    Petriu, Emil M.
    Whalen, Thomas E.
    Rudas, Intre J.
    Varkonyi-Koczy, Annamaria
    2008 IEEE INTERNATIONAL WORKSHOP ON MEDICAL MEASUREMENTS AND APPLICATIONS, 2008, : 1 - +
  • [48] Applications of airborne ultrasound in human-computer interaction
    Dahl, Tobias
    Ealo, Joao L.
    Bang, Hans J.
    Holm, Sverre
    Khuri-Yakub, Pierre
    ULTRASONICS, 2014, 54 (07) : 1912 - 1921
  • [49] Human-computer interaction in rapid prototyping applications
    Popescu, D.
    Annals of DAAAM for 2003 & Proceedings of the 14th International DAAAM Symposium: INTELLIGENT MANUFACTURING & AUTOMATION: FOCUS ON RECONSTRUCTION AND DEVELOPMENT, 2003, : 371 - 372
  • [50] Petri Nets Context Modeling for the Pervasive Human-Computer Interfaces
    Riahi, Ines
    Moussa, Faouzi
    Riahi, Meriem
    MODELING AND USING CONTEXT, CONTEXT 2013, 2013, 8175 : 316 - 329