Lip temporal pattern analysis for automatic visual speech recognition

被引：0

作者：

Xie, L ^{[1
]}

Cai, XL ^{[1
]}

Fu, ZH ^{[1
]}

Jiang, DM ^{[1
]}

Zhao, RC ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China

来源：

2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3 | 2004年

关键词：

visual speech recognition; lipreading; feature extraction; lip temporal pattern;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a novel approach to processing temporal lip motion information for dynamic visual feature extraction in visual speech recognition. The long-time Lip TenipoRA1 Patterns (LipTRAPs) of visual phonemes are introduced to analyze the nature of lip shape changes when uttering speech. A dynamic visual feature is also proposed based on the LipTRAPs. Visual speech recognition experiments on a connected-digits task show that the LipTRAP feature can yield significant WRR improvments than conventional delta features.

引用

页码：703 / 706

页数：4

共 50 条

[21] Automatic speech recognition using audio visual cues
Yashwanth, H
Mahendrakar, H
David, S
PROCEEDINGS OF THE IEEE INDICON 2004, 2004, : 166 - 169
[22] Research on an automated speech pattern recognition system based on lip movement
Zhang, B
Fukui, Y
PROCEEDINGS OF THE 18TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 18, PTS 1-5, 1997, 18 : 1530 - 1531
[23] Acoustic Analysis for Automatic Speech Recognition
O'Shaughnessy, Douglas
PROCEEDINGS OF THE IEEE, 2013, 101 (05) : 1038 - 1053
[24] Lip movement synthesis in audio-visual speech recognition system
Li, Junquan
Yin, Yixin
Proc. 2005 IEEE Int. Conf. on Lang. Process. Knowl. Engin. IEEE NLP-KE '05, (461-465):
[25] Lip movement synthesis in audio-visual speech recognition system
Li, JQ
Yin, YX
PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 461 - 465
[26] SPEECH RECOGNITION USING PATTERN ANALYSIS
SOROKIN, VN
ENGINEERING CYBERNETICS, 1966, (05): : 90 - &
[27] Audio-Visual Automatic Speech Recognition Using PZM, MFCC and Statistical Analysis
Debnath, Saswati
Roy, Pinki
INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2021, 7 (02): : 121 - 133
[28] Visual-speech-pass filtering for robust automatic lip-reading
Jong-Seok Lee
Pattern Analysis and Applications, 2014, 17 : 611 - 621
[29] Visual-speech-pass filtering for robust automatic lip-reading
Lee, Jong-Seok
PATTERN ANALYSIS AND APPLICATIONS, 2014, 17 (03) : 611 - 621
[30] An audio-visual corpus for speech perception and automatic speech recognition (L)
Cooke, Martin
Barker, Jon
Cunningham, Stuart
Shao, Xu
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (05): : 2421 - 2424

← 1 2 3 4 5 →