Visual Feature Extraction for Isolated Word Visual Only Speech Recognition of Vietnamese

被引：0

作者：

Nguyen Thien Chuong ^{[1
]}

Chaloupka, Josef ^{[1
]}

机构：

[1] Tech Univ Liberec, Inst Informat Technol & Elect, Liberec, Czech Republic

来源：

2013 36TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP) | 2013年

关键词：

Audio-visual speech recognition; isolated word recognition; LDA; Vietnamese language; visual feature;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents our research on visual feature extraction with some special treatment for dealing with Vietnamese language. The effect of linear discriminant analysis (LDA) when training with different sets of basic class will be examined. For improving the visual features, we proposed two types of visual front end for automatic lip-reading: (a) 1-Stage LDA visual front end; and (b) hierarchical LDA (HLDA) visual front end. We also compare four different types of visual feature on an isolated word visual only speech recognition of Vietnamese task using our recorded audio-visual speech database. Experiments on our database show that the proposed visual front end improves up to 8% of recognition accuracy and the HLDA visual front end outperform the other.

引用

页码：459 / 463

页数：5

共 50 条

[1] Visual speech feature extraction for improved speech recognition
Zhang, X
Mersereau, RM
Clements, M
Broun, CC
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1993 - 1996
[2] Automatic Visual Feature Extraction for Mandarin Audio-Visual Speech Recognition
Pao, Tsang-Long
Liao, Wen-Yuan
Wu, Tsan-Nung
Lin, Ching-Yi
2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 2936 - 2940
[3] A HYBRID VISUAL FEATURE EXTRACTION METHOD FOR AUDIO-VISUAL SPEECH RECOGNITION
Wu, Guanyong
Zhu, Jie
Xu, Haihua
2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 1829 - 1832
[4] Study of different feature extraction method for visual speech recognition
Debnath, Saswati
Roy, Pinki
Justin, Vijin
Naik, Shradha
2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
[5] An isolated word speech recognition using fusion of auditory and visual information
Shintani, A
Ogihara, A
Doi, N
Takamatsu, S
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1996, E79A (06) : 777 - 783
[6] A cascade gray-stereo visual feature extraction method for visual and audio-visual speech recognition
Sui, Chao
Togneri, Roberto
Bennamoun, Mohammed
SPEECH COMMUNICATION, 2017, 90 : 26 - 38
[7] Research on Visual Speech Feature Extraction
He Jun
Zhang Hua
2009 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND TECHNOLOGY, VOL II, PROCEEDINGS, 2009, : 499 - 502
[8] Visual Speech Recognition: a solution from feature extraction to words classification
Da Silveira, L
Facon, J
Borges, DL
XVI BRAZILIAN SYMPOSIUM ON COMPUTER GRAPHICS AND IMAGE PROCESSING, PROCEEDINGS, 2003, : 399 - 405
[9] Information Theoretic Feature Extraction for Audio-Visual Speech Recognition
Gurban, Mihai
Thiran, Jean-Philippe
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2009, 57 (12) : 4765 - 4776
[10] A robust visual feature extraction based BTSM-LDA for audio-visual speech recognition
Lv, Guoyun
Zhao, Rongchun
Jiang, Dongmei
Li, Yan
Sahli, H.
2007 SECOND INTERNATIONAL CONFERENCE IN COMMUNICATIONS AND NETWORKING IN CHINA, VOLS 1 AND 2, 2007, : 1044 - +

← 1 2 3 4 5 →