Speaker identification using speech and lip features

被引：0

作者：

Ou, GB ^{[1
]}

Li, X ^{[1
]}

Yao, XC ^{[1
]}

Jia, HB ^{[1
]}

Murphey, YL ^{[1
]}

机构：

[1] Univ Michigan, Dept Elect & Comp Engn, Intelligent Syst Lab, Dearborn, MI 48128 USA

来源：

PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5 | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a speaker identification system that uses synchronized speech signals and lip features. We developed an algorithm that automatically extracts lip areas from speaker images, and a neural network system that integrates the two different types of signals to give accurate identification of speakers. We show that the proposed system gives better performances than the systems that use only speech or lip features in both text dependant and text independent speaker identification applications.

引用

页码：2565 / 2570

页数：6

共 50 条

[1] Discriminative analysis of lip motion features for speaker identification and speech-reading
Cetinguel, H. Ertan
Yemez, Yuecel
Erzin, Engin
Tekalp, A. Murat
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (10) : 2879 - 2891
[2] Audiovisual Speaker Identification Based on Lip and Speech Modalities
Chelali, Fatma
Djeradi, Amar
[J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (01) : 99 - 110
[3] Robust lip-motion features for speaker identification
Çetingül, HE
Yemez, Y
Erzin, E
Tekalp, AM
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 509 - 512
[4] Adaptive fusion of speech and lip information for robust speaker identification
Wark, T
Sridharan, S
[J]. DIGITAL SIGNAL PROCESSING, 2001, 11 (03) : 169 - 186
[5] Discriminative lip-motion features for biometric speaker identification
Cetingül, HE
Yemez, Y
Erzin, E
Tekalp, AM
[J]. ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 2023 - 2026
[6] On optimal selection of lip-motion features for speaker identification
Çetingül, HE
Erzin, E
Yemez, Y
Tekalp, AM
[J]. 2004 IEEE 6TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2004, : 7 - 10
[7] Multimodal speaker/speech recognition using lip motion, lip texture and audio
Cetingul, H. E.
Erzin, E.
Yemez, Y.
Tekalp, A. M.
[J]. SIGNAL PROCESSING, 2006, 86 (12) : 3549 - 3558
[8] Performance enhancement of speaker identification systems using speech encryption and cancelable features
Soliman N.F.
Mostfa Z.
El-Samie F.E.A.
Abdalla M.I.
[J]. International Journal of Speech Technology, 2017, 20 (4) : 977 - 1004
[9] Speaker Identification using Whispered Speech
Jawarkar, Naresh P.
Holambe, Raghunath S.
Basu, Tapan Kumar
[J]. 2013 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT 2013), 2013, : 778 - 781
[10] SPEAKER IDENTIFICATION UTILIZING SELECTED TEMPORAL SPEECH FEATURES
JOHNSON, CC
HOLLIEN, H
HICKS, JW
[J]. JOURNAL OF PHONETICS, 1984, 12 (04) : 319 - 326

← 1 2 3 4 5 →