Visual Feature Extraction for Isolated Word Visual Only Speech Recognition of Vietnamese

被引:0
|
作者
Nguyen Thien Chuong [1 ]
Chaloupka, Josef [1 ]
机构
[1] Tech Univ Liberec, Inst Informat Technol & Elect, Liberec, Czech Republic
关键词
Audio-visual speech recognition; isolated word recognition; LDA; Vietnamese language; visual feature;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents our research on visual feature extraction with some special treatment for dealing with Vietnamese language. The effect of linear discriminant analysis (LDA) when training with different sets of basic class will be examined. For improving the visual features, we proposed two types of visual front end for automatic lip-reading: (a) 1-Stage LDA visual front end; and (b) hierarchical LDA (HLDA) visual front end. We also compare four different types of visual feature on an isolated word visual only speech recognition of Vietnamese task using our recorded audio-visual speech database. Experiments on our database show that the proposed visual front end improves up to 8% of recognition accuracy and the HLDA visual front end outperform the other.
引用
收藏
页码:459 / 463
页数:5
相关论文
共 50 条
  • [31] Hemispheric asymmetries in feature integration during visual word recognition
    Lindell, Annukka K.
    Arend, Isabel
    Ward, Robert
    Norton, Jennifer
    Wathan, Jennifer
    [J]. LATERALITY, 2007, 12 (06): : 543 - 558
  • [32] WORD TONE RECOGNITION IN VIETNAMESE WHISPERED SPEECH
    MILLER, JD
    [J]. WORD-JOURNAL OF THE INTERNATIONAL LINGUISTIC ASSOCIATION, 1961, 17 (01): : 11 - 15
  • [33] Comparative Study of Visual Feature for Bimodal Hindi Speech Recognition
    Upadhyaya, Prashant
    Farooq, Omar
    Abidi, M. R.
    Varshney, Priyanka
    [J]. ARCHIVES OF ACOUSTICS, 2015, 40 (04) : 609 - 619
  • [34] Speech Recognition System Based on Visual Feature for the Hearing Impaired
    Wang, Xu
    Han, Zhiyan
    Wang, Jian
    Gu, Mingtao
    [J]. ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2008, : 543 - +
  • [35] Relevant feature selection for audio-visual speech recognition
    Drugman, Thomas
    Gurban, Mihai
    Thiran, Jean-Philippe
    [J]. 2007 IEEE NINTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2007, : 179 - +
  • [36] Audio-Visual Speech Recognition Based on AAM Parameter and Phoneme Analysis of Visual Feature
    Komai, Yuto
    Ariki, Yasuo
    Takiguchi, Tetsuya
    [J]. ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PT I, 2011, 7087 : 97 - 108
  • [37] Hyper column model vs. fast DCT for feature extraction in visual Arabic speech recognition
    Sagheer, A
    Tsuruta, N
    Taniguchi, R
    Maeda, S
    [J]. 2005 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), VOLS 1 AND 2, 2005, : 761 - 766
  • [38] Development of Visual-Only Speech Recognition System for Mute People
    G. Aswanth Kumar
    Jino Hans William
    [J]. Circuits, Systems, and Signal Processing, 2022, 41 : 2152 - 2172
  • [39] Development of Visual-Only Speech Recognition System for Mute People
    Kumar, G. Aswanth
    William, Jino Hans
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (04) : 2152 - 2172
  • [40] Visual Acuity Test for Isolated Words using Speech Recognition
    Khan, Saud
    Ullah, Khalil
    [J]. 2017 INTERNATIONAL CONFERENCE ON INNOVATIONS IN ELECTRICAL ENGINEERING AND COMPUTATIONAL TECHNOLOGIES (ICIEECT), 2017,