Speaker and digit recognition by audio-visual lip biometrics

被引：0

作者：

Faraj, Maycel Isaac ^{[1
]}

Bigun, Josef ^{[1
]}

机构：

[1] Halmstad Univ, Sch Informat Sci Comp & Elect Engn, IDE, Box 823, SE-30118 Halmstad, Sweden

来源：

ADVANCES IN BIOMETRICS, PROCEEDINGS | 2007年 / 4642卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a new robust bi-modal audio visual digit and speaker recognition system by lip-motion and speech biometrics. To increase the robustness of digit and speaker recognition, we have proposed a method using speaker lip motion information extracted from video sequences with low resolution (128 x 128 pixels). In this paper we investigate a biometric system for digit recognition and speaker identification based using line-motion estimation with speech information and Support Vector Machines. The acoustic and visual features are fused at the feature level showing favourable results with digit recognition being 83% to 100% and speaker recognition 100% on the XM2VTS database.

引用

页码：1016 / +

页数：3

共 50 条

[1] Audio-visual biometrics
Aleksic, Petar S.
Katsaggelos, Aggelos K.
[J]. PROCEEDINGS OF THE IEEE, 2006, 94 (11) : 2025 - 2044
[2] Speaker independent audio-visual speech recognition
Zhang, Y
Levinson, S
Huang, T
[J]. 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1073 - 1076
[3] Multifactor fusion for audio-visual speaker recognition
Chetty, Girija
Tran, Dat
[J]. LECTURE NOTES IN SIGNAL SCIENCE, INTERNET AND EDUCATION (SSIP'07/MIV'07/DIWEB'07), 2007, : 70 - +
[4] Audio-visual system for robust speaker recognition
Chen, Q
Yang, JG
Gou, J
[J]. MLMTA '05: Proceedings of the International Conference on Machine Learning Models Technologies and Applications, 2005, : 97 - 103
[5] Lip biometrics for digit recognition
Faraj, Maycel Isaac
Bigun, Josef
[J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2007, 4673 : 360 - 365
[6] AUDIO-VISUAL ISOLATED DIGIT RECOGNITION FOR WHISPERED SPEECH
Fan, Xing
Busso, Carlos
Hansen, John H. L.
[J]. 19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1500 - 1503
[7] Audio-Visual Speech Recognition in the Presence of a Competing Speaker
Shao, Xu
Barker, Jon
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1292 - 1295
[8] Dynamic Bayesian Networks for audio-visual speaker recognition
Li, DD
Yang, YC
Wu, ZH
[J]. ADVANCES IN BIOMETRICS, PROCEEDINGS, 2006, 3832 : 539 - 545
[9] Speaker independent audio-visual continuous speech recognition
Liang, LH
Liu, XX
Zhao, YB
Pi, XB
Nefian, AV
[J]. IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : A25 - A28
[10] Audio-visual speaker recognition for video broadcast news
Maison, B
Neti, C
Senior, A
[J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2001, 29 (1-2): : 71 - 79

← 1 2 3 4 5 →