Lip-Based Visual Speech Recognition System

被引:0
|
作者
Frisky, Aufaclav Zatu Kusuma [1 ]
Wang, Chien-Yao [1 ]
Santoso, Andri [1 ]
Wang, Jia-Ching [1 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
关键词
visual speech recognition; kernel sparse representation classifier; non-negative matrix factorization; spatiotemporal descriptor;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper proposes a system to address the problem of visual speech recognition. The proposed system is based on visual lip movement recognition by applying video content analysis technique. Using spatiotemporal features descriptors, we extracted features from video containing visual lip information. A preprocessing step is employed by removing the noise and enhancing the contrast of images in every frames of video. Extracted feature are used to build a dictionary for kernel sparse representation classifier (K-SRC) in the classification step. We adopted non-negative matrix factorization (NMF) method to reduce the dimensionality of the extracted features. We evaluated the performance of our system using AVLetters and AVLetters2 dataset. To evaluate the performance of our system, we used the same configuration as another previous works. Using AVLetters dataset, the promising accuracies of 67.13%, 45.37%, and 63.12% can be achieved in semi speaker dependent, speaker independent, and speaker dependent, respectively. Using AVLetters2 dataset, our method can achieve accuracy rate of 89.02% for speaker dependent case and 25.9% for speaker independent. This result showed that our proposed method outperforms another methods using same configuration.
引用
收藏
页码:315 / 319
页数:5
相关论文
共 50 条
  • [1] Understanding visual lip-based biometric authentication for mobile devices
    Wright, Carrie
    Stewart, Darryl William
    [J]. EURASIP JOURNAL ON INFORMATION SECURITY, 2020, 2020 (01)
  • [2] LANGUAGE IDENTIFICATION AS IMPROVEMENT FOR LIP-BASED BIOMETRIC VISUAL SYSTEMS
    Cascone, Lucia
    Nappi, Michele
    Narducci, Fabio
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1570 - 1574
  • [3] Understanding visual lip-based biometric authentication for mobile devices
    Carrie Wright
    Darryl William Stewart
    [J]. EURASIP Journal on Information Security, 2020
  • [4] One-Shot-Learning for Visual Lip-Based Biometric Authentication
    Wright, Carrie
    Stewart, Darryl
    [J]. ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT I, 2020, 11844 : 405 - 417
  • [5] Lip movement synthesis in audio-visual speech recognition system
    Li, Junquan
    Yin, Yixin
    [J]. Proc. 2005 IEEE Int. Conf. on Lang. Process. Knowl. Engin. IEEE NLP-KE '05, (461-465):
  • [6] Lip movement synthesis in audio-visual speech recognition system
    Li, JQ
    Yin, YX
    [J]. PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 461 - 465
  • [7] Identification Approach Lip-Based Biometric
    Travieso, Carlos M.
    Briceno, Juan C.
    Alonso, Jesus B.
    [J]. RECENT ADVANCES IN INTELLIGENT ENGINEERING SYSTEMS, 2012, 378 : 341 - 360
  • [8] Lip Tracking Method for the System of Audio-Visual Polish Speech Recognition
    Kubanek, Mariusz
    Bobulski, Janusz
    Adrjanowicz, Lukasz
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT I, 2012, 7267 : 535 - 542
  • [9] Improved Lip Contour Extraction For Visual Speech Recognition
    Chalamala, Srinivasa Rao
    Gudla, Balakrishna
    Yegnanarayana, B.
    Sheela, Anitha K.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2015, : 459 - 462
  • [10] Lip location normalized training for visual speech recognition
    Vanegas, O
    Tokuda, K
    Kitamura, T
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2000, E83D (11): : 1969 - 1977