Unsupervised intra-speaker variability compensation based on Gestalt and model adaptation in speaker verification with telephone speech

被引:2
|
作者
Yoma, Nestor Becerra [1 ]
Garreton, Claudio [1 ]
Molina, Carlos [1 ]
Huenupan, Fernando [1 ]
机构
[1] Univ Chile, Speech Proc & Transmiss Lab, Dept Elect Engn, Santiago, Chile
关键词
Text-dependent speaker verification; Feature compensation; Intra-speaker variability; Unsupervised model adaptation; Gestalt; Telephone speech; Limited enrolling data; Noise robustness; Speaker verification database in Spanish;
D O I
10.1016/j.specom.2007.11.005
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, an unsupervised intra-speaker variability compensation (ISVC) method based oil Gestalt is proposed to address the problem of limited enrolling data and noise robustness in text-dependent speaker verification (SV). Experiments with two databases show that: ISVC can lead to reductions in EER as high as 20% or 40% and ISCV provides reductions in the integral below the ROC curve between 30%, and 60%. Also, the observed improvements are independent of the number of enrolling utterances. In contrast to model adaptation methods, ISVC is memoryless with respect to previous verification attempts. As shown here, unsupervised model adaptation can lead to substantial improvements in EER but is highly dependent oil the sequence of client/impostor verification events. In adverse scenarios, such its massive impostor attacks and verification from alternated telephone line, unsupervised model adaptation might even provide reductions in verification accuracy when compared with the baseline system. In those cases, ISVC can even outperform adaptation schemes. It is worth emphasizing that ISVC and unsupervised model adaptation are compatible and the combination of both methods always improves the performance of model adaptation. The combination of both schemes can lead to improvements in EER its high its 34%. Due to the restrictions of commercially available databases for text-dependent SV research, the results presented here are based oil local databases in Spanish. By doing so, the visibility of research in Iberian Languages is highlighted. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:953 / 964
页数:12
相关论文
共 50 条
  • [1] On comparing and combining intra-speaker variability compensation and unsupervised model adaptation in speaker verification
    Garreton, Claudio
    Yoma, Nestor Becerra
    Huenupan, Fernando
    Molina, Carlos
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 913 - 916
  • [2] Unsupervised Compensation of Intra-Session Intra-Speaker Variability for Speaker Diarization
    Aronowitz, Hagai
    [J]. ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 138 - 145
  • [3] Intra-speaker variability compensation in speaker verification with limited enrolling data
    Garreton, Claudio
    Becerra Yoma, Nestor
    Molina, Carlos
    Huenupan, Fernando
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 509 - 512
  • [4] Intra-speaker variability effects on Speaker Verification performance
    Kahn, Juliette
    Audibert, Nicolas
    Rossato, Solange
    Bonastre, Jean-Francois
    [J]. ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 109 - 116
  • [5] INTRA-CONVERSATION INTRA-SPEAKER VARIABILITY COMPENSATION FOR SPEAKER CLUSTERING
    Wu, Kui
    Song, Yan
    Guo, Wu
    Dai, LiRong
    [J]. 2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 330 - 334
  • [6] On the issues of intra-speaker variability and realism in speech, speaker, and language recognition tasks
    Hansen, John H. L.
    Boril, Hynek
    [J]. SPEECH COMMUNICATION, 2018, 101 : 94 - 108
  • [7] Unsupervised model adaptation for speaker verification
    Preti, Alexandre
    Bonastre, Jean-Francois
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2090 - 2093
  • [8] INTRA-SPEAKER VARIABILITY OF THE LONG-TERM SPEECH SPECTRUM
    HARMEGNIES, B
    LANDERCY, A
    [J]. SPEECH COMMUNICATION, 1988, 7 (01) : 81 - 86
  • [9] On the sources of inter- & intra-speaker variability in the acoustic dynamics of speech
    Yang, X
    Millar, JB
    Macleod, I
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1792 - 1795
  • [10] Intra-speaker and inter-speaker variability in speech sound pressure level across repeated readings
    Castellana, Antonella
    Carullo, Alessio
    Astolfi, Arianna
    Puglisi, Giuseppina Emma
    Fugiglando, Umberto
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (04): : 2353 - 2363