ON THE USE OF SPEAKER SUPERFACTORS FOR SPEAKER RECOGNITION

被引:0
|
作者
Scheffer, Nicolas [1 ]
Vogt, Robbie [2 ]
机构
[1] SRI Int, Menlo Pk, CA 94025 USA
[2] Queensland Univ Technol, Brisbane, Qld, Australia
基金
澳大利亚研究理事会;
关键词
speaker recognition;
D O I
10.1109/ICASSP.2010.5495631
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose a new method to characterize a speaker within the Joint Factor Analysis (JFA) framework. Scoring within the JFA framework can be costly and a new method was proposed to produce an accurate score in a fast manner. However, this method is nonsymmetric and performs badly without any score normalization. We propose a new JFA scoring method that is both symmetrical and efficient. In the same way as means of Gaussians can be concatenated to form a supervector, we use several estimates of speaker factors from the eigenvoice space to build a supervector of factors that we call superfactors. We motivate the use of such factors in the current JFA model through comparison with a Tied Factor Analysis model. We show that this method substantially improves the performance of a system that uses only the standard speaker factors to produce scores, and usually outperforms the baseline system. We also show that this method is relatively effective even when score normalization is not an option.
引用
收藏
页码:4410 / 4413
页数:4
相关论文
共 50 条
  • [1] From Speaker Recognition to Forensic Speaker Recognition
    Drygajlo, Andrzej
    [J]. BIOMETRIC AUTHENTICATION (BIOMET 2014), 2014, 8897 : 93 - 104
  • [2] Speaker recognition and speaker normalization by projection to speaker subspace
    Ariki, Y
    Tagashira, S
    Nishijima, M
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 319 - 322
  • [3] The use of harmonic features in speaker recognition
    Imperl, B
    Kacic, Z
    Horvat, B
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1131 - 1134
  • [4] On the use of orthogonal GMM in speaker recognition
    Liu, L
    He, JL
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 845 - 848
  • [5] Use of The Harmonic Phase in Speaker Recognition
    Hernaez, Inma
    Saratxaga, Ibon
    Sanchez, Jon
    Navas, Eva
    Luengo, Iker
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2768 - +
  • [6] Speaker Dependent Coefficients for Speaker Recognition
    Orsag, Filip
    [J]. INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2010, 4 (01): : 31 - 47
  • [7] On the Use of Speaker Information for Automatic Speech Recognition in Speaker-imbalanced Corpora
    Soky, Kak
    Li, Sheng
    Mimura, Masato
    Chu, Chenhui
    Kawahara, Tatsuya
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 433 - 437
  • [8] Speaker Recognition
    Tripathi, Supriya
    Bhatnagar, Smriti
    [J]. 2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGY (ICCCT), 2012, : 283 - 287
  • [9] Use of Silence as an Altered Approach for Speaker Recognition
    Pawar, Rupali
    Jalnekar, R. M.
    Chitode, J. S.
    [J]. INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2016, 10 (05): : 57 - 61
  • [10] The automated speaker recognition system of critical use
    Bykov, Mykola M.
    Kovtun, Viacheslav V.
    Ivasyuk, Igor D.
    Kotyra, Andrzej
    Mussabekova, Aisha
    [J]. PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2018, 2018, 10808