Speech variability in automatic speaker recognition systems for commercial and forensic purposes

被引:0
|
作者
Ortega-García, J [1 ]
González-Rodríguez, J [1 ]
Cruz-Llanas, S [1 ]
机构
[1] Univ Politecn Madrid, Dept Ingn Audiovisual & Comunicac, E-28031 Madrid, Spain
关键词
D O I
暂无
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Speaker Recognition is a major task when security applications through speech input are needed. Nevertheless, speech variability is a main degradation factor in speaker recognition tasks. Both intra-speaker and external variability sources produce mismatch between training and testing phases. In this contribution, channel and inter-session variability will be explored in order to accomplish real automatic systems for both commercial and forensic speaker recognition. Results will be presented making use of "AHUMADA," a subset of "GAUDI" large speaker recognition-oriented database in Spanish.
引用
收藏
页码:27 / 32
页数:6
相关论文
共 50 条
  • [41] On the Use of Speaker Information for Automatic Speech Recognition in Speaker-imbalanced Corpora
    Soky, Kak
    Li, Sheng
    Mimura, Masato
    Chu, Chenhui
    Kawahara, Tatsuya
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 433 - 437
  • [42] Extracting accent information from Urdu speech for forensic speaker recognition
    Tahir, Falak
    Saleem, Sajid
    Ahmad, Ayaz
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (05) : 3763 - 3778
  • [43] Impact of Emotional Speech to Automatic Speaker Recognition - Experiments on GEES Speech Database
    Jokic, Ivan
    Jokic, Stevan
    Delic, Vlado
    Peric, Zoran
    [J]. SPEECH AND COMPUTER, 2014, 8773 : 268 - 275
  • [44] Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition
    Sivasankaran, Sunit
    Vincent, Emmanuel
    Fohr, Dominique
    [J]. 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 346 - 350
  • [45] Methodologies for the evaluation of Speaker Diarization and Automatic Speech Recognition in the presence of overlapping speech
    Galibert, Olivier
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1130 - 1133
  • [46] Analysis of Compressed Speech Signals in an Automatic Speaker Recognition System
    Metzger, Richard A.
    Doherty, John F.
    Jenkins, David M.
    [J]. 2015 49TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2015,
  • [47] SPEAKER-ADAPTABLE CLASSIFICATION PROCEDURE FOR AUTOMATIC SPEECH RECOGNITION
    KATTERFELDT, H
    THON, W
    [J]. NACHRICHTENTECHNISCHE ZEITSCHRIFT, 1974, 27 (06): : 230 - 232
  • [48] DYNAMIC FREQUENCY WARPING FOR SPEAKER ADAPTATION IN AUTOMATIC SPEECH RECOGNITION
    PALIWAL, KK
    AINSWORTH, WA
    [J]. JOURNAL OF PHONETICS, 1985, 13 (02) : 123 - 134
  • [49] Null-Hypothesis LLR: A proposal for Forensic Automatic Speaker Recognition
    Solewicz, Yosef A.
    Jessen, Michael
    van der Vloed, David
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2849 - 2853
  • [50] The Case for Automatic Higher-Level Features in Forensic Speaker Recognition
    Shriberg, Elizabeth
    Stolcke, Andreas
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1509 - 1512