Speaker verification using excitation source information

被引:0
|
作者
Debadatta Pati
S. R. Mahadeva Prasanna
机构
[1] Indian Institute of Technology Guwahati,Department of Electronics and Electrical Engineering
关键词
Speaker-specific excitation information; Subsegmental; Segmental; Suprasegmental; LP residual; LF model;
D O I
10.1007/s10772-012-9137-5
中图分类号
学科分类号
摘要
In this work we develop a speaker recognition system based on the excitation source information and demonstrate its significance by comparing with the vocal tract information based system. The speaker-specific excitation information is extracted by the subsegmental, segmental and suprasegmental processing of the LP residual. The speaker-specific information from each level is modeled independently using Gaussian mixture modeling—universal background model (GMM-UBM) modeling and then combined at the score level. The significance of the proposed speaker recognition system is demonstrated by conducting speaker verification experiments on the NIST-03 database. Two different tests, namely, Clean test and Noisy test are conducted. In case of Clean test, the test speech signal is used as it is for verification. In case of Noisy test, the test speech is corrupted by factory noise (9 dB) and then used for verification. Even though for Clean test case, the proposed source based speaker recognition system still provides relatively poor performance than the vocal tract information, its performance is better for Noisy test case. Finally, for both clean and noisy cases, by providing different and robust speaker-specific evidences, the proposed system helps the vocal tract system to further improve the overall performance.
引用
收藏
页码:241 / 257
页数:16
相关论文
共 50 条
  • [1] Speaker verification using excitation source information
    Pati, Debadatta
    Prasanna, S. R. Mahadeva
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (02) : 241 - 257
  • [2] Significance of excitation source sequence information for Speaker Verification
    Agarwal, Ayush
    Mishra, Jagabandhu
    Prasanna, S. R. Mahadeva
    2022 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, SPCOM, 2022,
  • [3] Speaker localization using excitation source information in speech
    Raykar, VC
    Yegnanarayana, B
    Prasanna, SRM
    Duraiswami, R
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 751 - 761
  • [4] Speaker verification using complementary information from vocal source and vocal tract
    Zheng, Nengheng
    Wang, Ning
    Lee, Tan
    Ching, P. C.
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 518 - +
  • [5] Speaker Change Detection using Excitation Source and Vocal Tract System Information
    Sarma, Mousmita
    Gadre, Sree Nilendra
    Sarma, Biswajit Dev
    Prasanna, S. R. Mahadeva
    2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
  • [6] Speaker Recognition using Excitation Source Parameters
    Kamarauskas, J.
    Salna, B.
    ELEKTRONIKA IR ELEKTROTECHNIKA, 2011, (01) : 55 - 58
  • [7] Speaker verification using verbal information verification for automatic enrollment
    Li, Q
    Juang, BH
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 133 - 136
  • [8] Different Aspects of Source Information for Limited Data Speaker Verification
    Das, Rohan Kumar
    Pati, Debadatta
    Prasanna, S. R. Mahadeva
    2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
  • [9] Using nonstandard SVM for combination of speaker verification and verbal information verification in speaker authentication system
    Liu, Y
    Ding, P
    Xu, B
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 673 - 676
  • [10] A Discriminative Method for Speaker Verification Using the Difference Information
    Lei, Zhenchun
    Yang, Yingchun
    Wu, Zhaohui
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 497 - 500