Visual Feature Extraction for Isolated Word Visual Only Speech Recognition of Vietnamese

被引：0

作者：

Nguyen Thien Chuong ^{[1
]}

Chaloupka, Josef ^{[1
]}

机构：

[1] Tech Univ Liberec, Inst Informat Technol & Elect, Liberec, Czech Republic

来源：

2013 36TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP) | 2013年

关键词：

Audio-visual speech recognition; isolated word recognition; LDA; Vietnamese language; visual feature;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents our research on visual feature extraction with some special treatment for dealing with Vietnamese language. The effect of linear discriminant analysis (LDA) when training with different sets of basic class will be examined. For improving the visual features, we proposed two types of visual front end for automatic lip-reading: (a) 1-Stage LDA visual front end; and (b) hierarchical LDA (HLDA) visual front end. We also compare four different types of visual feature on an isolated word visual only speech recognition of Vietnamese task using our recorded audio-visual speech database. Experiments on our database show that the proposed visual front end improves up to 8% of recognition accuracy and the HLDA visual front end outperform the other.

引用

页码：459 / 463

页数：5

共 50 条

[31] Hemispheric asymmetries in feature integration during visual word recognition
Lindell, Annukka K.
Arend, Isabel
Ward, Robert
Norton, Jennifer
Wathan, Jennifer
LATERALITY, 2007, 12 (06): : 543 - 558
[32] WORD TONE RECOGNITION IN VIETNAMESE WHISPERED SPEECH
MILLER, JD
WORD-JOURNAL OF THE INTERNATIONAL LINGUISTIC ASSOCIATION, 1961, 17 (01): : 11 - 15
[33] Comparative Study of Visual Feature for Bimodal Hindi Speech Recognition
Upadhyaya, Prashant
Farooq, Omar
Abidi, M. R.
Varshney, Priyanka
ARCHIVES OF ACOUSTICS, 2015, 40 (04) : 609 - 619
[34] Speech Recognition System Based on Visual Feature for the Hearing Impaired
Wang, Xu
Han, Zhiyan
Wang, Jian
Gu, Mingtao
ICNC 2008: FOURTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 2, PROCEEDINGS, 2008, : 543 - +
[35] Relevant feature selection for audio-visual speech recognition
Drugman, Thomas
Gurban, Mihai
Thiran, Jean-Philippe
2007 IEEE NINTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2007, : 179 - +
[36] Audio-Visual Speech Recognition Based on AAM Parameter and Phoneme Analysis of Visual Feature
Komai, Yuto
Ariki, Yasuo
Takiguchi, Tetsuya
ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PT I, 2011, 7087 : 97 - 108
[37] Hyper column model vs. fast DCT for feature extraction in visual Arabic speech recognition
Sagheer, A
Tsuruta, N
Taniguchi, R
Maeda, S
2005 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), VOLS 1 AND 2, 2005, : 761 - 766
[38] Influence of visual analogue of speech envelope, formants, and word onsets on word recognition is not pronounced
Benz, Kaja Rosa
Hauswald, Anne
Weisz, Nathan
HEARING RESEARCH, 2025, 460
[39] Development of Visual-Only Speech Recognition System for Mute People
G. Aswanth Kumar
Jino Hans William
Circuits, Systems, and Signal Processing, 2022, 41 : 2152 - 2172
[40] Development of Visual-Only Speech Recognition System for Mute People
Kumar, G. Aswanth
William, Jino Hans
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (04) : 2152 - 2172

← 1 2 3 4 5 →