Speaker verification using excitation source information

被引：0

作者：

Debadatta Pati

S. R. Mahadeva Prasanna

机构：

[1] Indian Institute of Technology Guwahati,Department of Electronics and Electrical Engineering

来源：

International Journal of Speech Technology | 2012年 / 15卷 / 2期

关键词：

Speaker-specific excitation information; Subsegmental; Segmental; Suprasegmental; LP residual; LF model;

D O I：

10.1007/s10772-012-9137-5

中图分类号：

学科分类号：

摘要：

In this work we develop a speaker recognition system based on the excitation source information and demonstrate its significance by comparing with the vocal tract information based system. The speaker-specific excitation information is extracted by the subsegmental, segmental and suprasegmental processing of the LP residual. The speaker-specific information from each level is modeled independently using Gaussian mixture modeling—universal background model (GMM-UBM) modeling and then combined at the score level. The significance of the proposed speaker recognition system is demonstrated by conducting speaker verification experiments on the NIST-03 database. Two different tests, namely, Clean test and Noisy test are conducted. In case of Clean test, the test speech signal is used as it is for verification. In case of Noisy test, the test speech is corrupted by factory noise (9 dB) and then used for verification. Even though for Clean test case, the proposed source based speaker recognition system still provides relatively poor performance than the vocal tract information, its performance is better for Noisy test case. Finally, for both clean and noisy cases, by providing different and robust speaker-specific evidences, the proposed system helps the vocal tract system to further improve the overall performance.

引用

页码：241 / 257

页数：16

共 50 条

[1] Speaker verification using excitation source information
Pati, Debadatta
Prasanna, S. R. Mahadeva
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (02) : 241 - 257
[2] Significance of excitation source sequence information for Speaker Verification
Agarwal, Ayush
Mishra, Jagabandhu
Prasanna, S. R. Mahadeva
2022 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, SPCOM, 2022,
[3] Speaker localization using excitation source information in speech
Raykar, VC
Yegnanarayana, B
Prasanna, SRM
Duraiswami, R
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 751 - 761
[4] Speaker verification using complementary information from vocal source and vocal tract
Zheng, Nengheng
Wang, Ning
Lee, Tan
Ching, P. C.
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 518 - +
[5] Speaker Change Detection using Excitation Source and Vocal Tract System Information
Sarma, Mousmita
Gadre, Sree Nilendra
Sarma, Biswajit Dev
Prasanna, S. R. Mahadeva
2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
[6] Speaker Recognition using Excitation Source Parameters
Kamarauskas, J.
Salna, B.
ELEKTRONIKA IR ELEKTROTECHNIKA, 2011, (01) : 55 - 58
[7] Speaker verification using verbal information verification for automatic enrollment
Li, Q
Juang, BH
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 133 - 136
[8] Different Aspects of Source Information for Limited Data Speaker Verification
Das, Rohan Kumar
Pati, Debadatta
Prasanna, S. R. Mahadeva
2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
[9] Using nonstandard SVM for combination of speaker verification and verbal information verification in speaker authentication system
Liu, Y
Ding, P
Xu, B
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 673 - 676
[10] A Discriminative Method for Speaker Verification Using the Difference Information
Lei, Zhenchun
Yang, Yingchun
Wu, Zhaohui
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 497 - 500

← 1 2 3 4 5 →