Speaker verification using excitation source information

被引:0
|
作者
Pati, Debadatta [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India
关键词
Speaker-specific excitation information; Subsegmental; Segmental; Suprasegmental; LP residual; LF model;
D O I
10.1007/s10772-012-9137-5
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this work we develop a speaker recognition system based on the excitation source information and demonstrate its significance by comparing with the vocal tract information based system. The speaker-specific excitation information is extracted by the subsegmental, segmental and suprasegmental processing of the LP residual. The speaker-specific information from each level is modeled independently using Gaussian mixture modeling-universal background model (GMM-UBM) modeling and then combined at the score level. The significance of the proposed speaker recognition system is demonstrated by conducting speaker verification experiments on the NIST-03 database. Two different tests, namely, Clean test and Noisy test are conducted. In case of Clean test, the test speech signal is used as it is for verification. In case of Noisy test, the test speech is corrupted by factory noise (9 dB) and then used for verification. Even though for Clean test case, the proposed source based speaker recognition system still provides relatively poor performance than the vocal tract information, its performance is better for Noisy test case. Finally, for both clean and noisy cases, by providing different and robust speaker-specific evidences, the proposed system helps the vocal tract system to further improve the overall performance.
引用
收藏
页码:241 / 257
页数:17
相关论文
共 50 条
  • [21] DNN BASED SPEAKER EMBEDDING USING CONTENT INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION
    Dey, Subhadeep
    Koshinaka, Takafumi
    Motlicek, Petr
    Madikeri, Srikanth
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5344 - 5348
  • [22] Speaker change detection in casual conversations using excitation source features
    Dhananjaya, N.
    Yegnanarayana, B.
    SPEECH COMMUNICATION, 2008, 50 (02) : 153 - 161
  • [23] A new cohort normalization using local acoustic information for speaker verification
    Isobe, T
    Takahashi, J
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 841 - 844
  • [24] Speech enhancement using excitation source information
    Yegnanarayana, B
    Prasanna, SRM
    Rao, KS
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 541 - 544
  • [25] Speaker verification using signatures
    Chatelain, P
    ELECTRONICS LETTERS, 1998, 34 (15) : 1472 - 1473
  • [26] SPEAKER VERIFICATION USING PASSWORDS
    HELMS, RE
    DODDINGTON, GR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 59 : S96 - S96
  • [27] Speaker Recognition from Excitation Source Perspective
    Pati, Debadatta
    Prasanna, S. R. Mahadeva
    IETE TECHNICAL REVIEW, 2010, 27 (02) : 138 - 157
  • [28] Restoring the Residual Speaker Information in Total Variability Modeling for Speaker Verification
    Zhang, Ce
    Zheng, Rong
    Xu, Bo
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 132 - 135
  • [29] Mutual Information Adaptive Estimation for Speaker Verification
    Chen C.
    Ji C.
    Li W.
    Chen D.
    Wang L.
    Yang H.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2023, 52 (01): : 125 - 131
  • [30] A comparative study of explicit and implicit modelling of subsegmental speaker-specific excitation source information
    DEBADATTA PATI
    S R MAHADEVA PRASANNA
    Sadhana, 2013, 38 : 591 - 620