ON THE USE OF SPEAKER SUPERFACTORS FOR SPEAKER RECOGNITION

被引：0

作者：

Scheffer, Nicolas ^{[1
]}

Vogt, Robbie ^{[2
]}

机构：

[1] SRI Int, Menlo Pk, CA 94025 USA

[2] Queensland Univ Technol, Brisbane, Qld, Australia

来源：

2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年

基金：

澳大利亚研究理事会;

关键词：

speaker recognition;

D O I：

10.1109/ICASSP.2010.5495631

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We propose a new method to characterize a speaker within the Joint Factor Analysis (JFA) framework. Scoring within the JFA framework can be costly and a new method was proposed to produce an accurate score in a fast manner. However, this method is nonsymmetric and performs badly without any score normalization. We propose a new JFA scoring method that is both symmetrical and efficient. In the same way as means of Gaussians can be concatenated to form a supervector, we use several estimates of speaker factors from the eigenvoice space to build a supervector of factors that we call superfactors. We motivate the use of such factors in the current JFA model through comparison with a Tied Factor Analysis model. We show that this method substantially improves the performance of a system that uses only the standard speaker factors to produce scores, and usually outperforms the baseline system. We also show that this method is relatively effective even when score normalization is not an option.

引用

页码：4410 / 4413

页数：4

共 50 条

[1] From Speaker Recognition to Forensic Speaker Recognition
Drygajlo, Andrzej
[J]. BIOMETRIC AUTHENTICATION (BIOMET 2014), 2014, 8897 : 93 - 104
[2] Speaker recognition and speaker normalization by projection to speaker subspace
Ariki, Y
Tagashira, S
Nishijima, M
[J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 319 - 322
[3] The use of harmonic features in speaker recognition
Imperl, B
Kacic, Z
Horvat, B
[J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1131 - 1134
[4] On the use of orthogonal GMM in speaker recognition
Liu, L
He, JL
[J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 845 - 848
[5] Use of The Harmonic Phase in Speaker Recognition
Hernaez, Inma
Saratxaga, Ibon
Sanchez, Jon
Navas, Eva
Luengo, Iker
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2768 - +
[6] Speaker Dependent Coefficients for Speaker Recognition
Orsag, Filip
[J]. INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2010, 4 (01): : 31 - 47
[7] On the Use of Speaker Information for Automatic Speech Recognition in Speaker-imbalanced Corpora
Soky, Kak
Li, Sheng
Mimura, Masato
Chu, Chenhui
Kawahara, Tatsuya
[J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 433 - 437
[8] Speaker Recognition
Tripathi, Supriya
Bhatnagar, Smriti
[J]. 2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGY (ICCCT), 2012, : 283 - 287
[9] Use of Silence as an Altered Approach for Speaker Recognition
Pawar, Rupali
Jalnekar, R. M.
Chitode, J. S.
[J]. INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2016, 10 (05): : 57 - 61
[10] The automated speaker recognition system of critical use
Bykov, Mykola M.
Kovtun, Viacheslav V.
Ivasyuk, Igor D.
Kotyra, Andrzej
Mussabekova, Aisha
[J]. PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2018, 2018, 10808

← 1 2 3 4 5 →