Visual Feature Extraction for Isolated Word Visual Only Speech Recognition of Vietnamese

被引:0
|
作者
Nguyen Thien Chuong [1 ]
Chaloupka, Josef [1 ]
机构
[1] Tech Univ Liberec, Inst Informat Technol & Elect, Liberec, Czech Republic
关键词
Audio-visual speech recognition; isolated word recognition; LDA; Vietnamese language; visual feature;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents our research on visual feature extraction with some special treatment for dealing with Vietnamese language. The effect of linear discriminant analysis (LDA) when training with different sets of basic class will be examined. For improving the visual features, we proposed two types of visual front end for automatic lip-reading: (a) 1-Stage LDA visual front end; and (b) hierarchical LDA (HLDA) visual front end. We also compare four different types of visual feature on an isolated word visual only speech recognition of Vietnamese task using our recorded audio-visual speech database. Experiments on our database show that the proposed visual front end improves up to 8% of recognition accuracy and the HLDA visual front end outperform the other.
引用
收藏
页码:459 / 463
页数:5
相关论文
共 50 条
  • [1] Visual speech feature extraction for improved speech recognition
    Zhang, X
    Mersereau, RM
    Clements, M
    Broun, CC
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1993 - 1996
  • [2] Automatic Visual Feature Extraction for Mandarin Audio-Visual Speech Recognition
    Pao, Tsang-Long
    Liao, Wen-Yuan
    Wu, Tsan-Nung
    Lin, Ching-Yi
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 2936 - 2940
  • [3] A HYBRID VISUAL FEATURE EXTRACTION METHOD FOR AUDIO-VISUAL SPEECH RECOGNITION
    Wu, Guanyong
    Zhu, Jie
    Xu, Haihua
    [J]. 2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 1829 - 1832
  • [4] Study of different feature extraction method for visual speech recognition
    Debnath, Saswati
    Roy, Pinki
    Justin, Vijin
    Naik, Shradha
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [5] An isolated word speech recognition using fusion of auditory and visual information
    Shintani, A
    Ogihara, A
    Doi, N
    Takamatsu, S
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1996, E79A (06) : 777 - 783
  • [6] A cascade gray-stereo visual feature extraction method for visual and audio-visual speech recognition
    Sui, Chao
    Togneri, Roberto
    Bennamoun, Mohammed
    [J]. SPEECH COMMUNICATION, 2017, 90 : 26 - 38
  • [7] Research on Visual Speech Feature Extraction
    He Jun
    Zhang Hua
    [J]. 2009 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND TECHNOLOGY, VOL II, PROCEEDINGS, 2009, : 499 - 502
  • [8] Visual Speech Recognition: a solution from feature extraction to words classification
    Da Silveira, L
    Facon, J
    Borges, DL
    [J]. XVI BRAZILIAN SYMPOSIUM ON COMPUTER GRAPHICS AND IMAGE PROCESSING, PROCEEDINGS, 2003, : 399 - 405
  • [9] Information Theoretic Feature Extraction for Audio-Visual Speech Recognition
    Gurban, Mihai
    Thiran, Jean-Philippe
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2009, 57 (12) : 4765 - 4776
  • [10] A robust visual feature extraction based BTSM-LDA for audio-visual speech recognition
    Lv, Guoyun
    Zhao, Rongchun
    Jiang, Dongmei
    Li, Yan
    Sahli, H.
    [J]. 2007 SECOND INTERNATIONAL CONFERENCE IN COMMUNICATIONS AND NETWORKING IN CHINA, VOLS 1 AND 2, 2007, : 1044 - +