Speaker identification using speech and lip features

被引:0
|
作者
Ou, GB [1 ]
Li, X [1 ]
Yao, XC [1 ]
Jia, HB [1 ]
Murphey, YL [1 ]
机构
[1] Univ Michigan, Dept Elect & Comp Engn, Intelligent Syst Lab, Dearborn, MI 48128 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a speaker identification system that uses synchronized speech signals and lip features. We developed an algorithm that automatically extracts lip areas from speaker images, and a neural network system that integrates the two different types of signals to give accurate identification of speakers. We show that the proposed system gives better performances than the systems that use only speech or lip features in both text dependant and text independent speaker identification applications.
引用
收藏
页码:2565 / 2570
页数:6
相关论文
共 50 条
  • [1] Discriminative analysis of lip motion features for speaker identification and speech-reading
    Cetinguel, H. Ertan
    Yemez, Yuecel
    Erzin, Engin
    Tekalp, A. Murat
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (10) : 2879 - 2891
  • [2] Audiovisual Speaker Identification Based on Lip and Speech Modalities
    Chelali, Fatma
    Djeradi, Amar
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (01) : 99 - 110
  • [3] Robust lip-motion features for speaker identification
    Çetingül, HE
    Yemez, Y
    Erzin, E
    Tekalp, AM
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 509 - 512
  • [4] Adaptive fusion of speech and lip information for robust speaker identification
    Wark, T
    Sridharan, S
    [J]. DIGITAL SIGNAL PROCESSING, 2001, 11 (03) : 169 - 186
  • [5] Discriminative lip-motion features for biometric speaker identification
    Cetingül, HE
    Yemez, Y
    Erzin, E
    Tekalp, AM
    [J]. ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 2023 - 2026
  • [6] On optimal selection of lip-motion features for speaker identification
    Çetingül, HE
    Erzin, E
    Yemez, Y
    Tekalp, AM
    [J]. 2004 IEEE 6TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2004, : 7 - 10
  • [7] Multimodal speaker/speech recognition using lip motion, lip texture and audio
    Cetingul, H. E.
    Erzin, E.
    Yemez, Y.
    Tekalp, A. M.
    [J]. SIGNAL PROCESSING, 2006, 86 (12) : 3549 - 3558
  • [8] Performance enhancement of speaker identification systems using speech encryption and cancelable features
    Soliman N.F.
    Mostfa Z.
    El-Samie F.E.A.
    Abdalla M.I.
    [J]. International Journal of Speech Technology, 2017, 20 (4) : 977 - 1004
  • [9] Speaker Identification using Whispered Speech
    Jawarkar, Naresh P.
    Holambe, Raghunath S.
    Basu, Tapan Kumar
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT 2013), 2013, : 778 - 781
  • [10] SPEAKER IDENTIFICATION UTILIZING SELECTED TEMPORAL SPEECH FEATURES
    JOHNSON, CC
    HOLLIEN, H
    HICKS, JW
    [J]. JOURNAL OF PHONETICS, 1984, 12 (04) : 319 - 326