Speech variability in automatic speaker recognition systems for commercial and forensic purposes

被引:0
|
作者
Ortega-García, J [1 ]
González-Rodríguez, J [1 ]
Cruz-Llanas, S [1 ]
机构
[1] Univ Politecn Madrid, Dept Ingn Audiovisual & Comunicac, E-28031 Madrid, Spain
关键词
D O I
暂无
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Speaker Recognition is a major task when security applications through speech input are needed. Nevertheless, speech variability is a main degradation factor in speaker recognition tasks. Both intra-speaker and external variability sources produce mismatch between training and testing phases. In this contribution, channel and inter-session variability will be explored in order to accomplish real automatic systems for both commercial and forensic speaker recognition. Results will be presented making use of "AHUMADA," a subset of "GAUDI" large speaker recognition-oriented database in Spanish.
引用
收藏
页码:27 / 32
页数:6
相关论文
共 50 条
  • [31] SPEECH RECOGNITION SYSTEM WITH AUTOMATIC SPEAKER-ADAPTION
    BROUWER, P
    [J]. FREQUENZ, 1978, 32 (07) : 204 - 207
  • [32] An Automatic Speech Recognition Solution with Speaker Identification Support
    Buzo, Andi
    Cucu, Horia
    Petrica, Lucian
    Burileanu, Dragos
    Burileanu, Corneliu
    [J]. 2014 10TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2014,
  • [33] Improved automatic speech recognition through speaker normalization
    Giuliani, D
    Gerosa, M
    Brugnara, F
    [J]. COMPUTER SPEECH AND LANGUAGE, 2006, 20 (01): : 107 - 123
  • [34] Statistical Evaluation of Biometric Evidence in Forensic Automatic Speaker Recognition
    Drygajlo, Andrzej
    [J]. COMPUTATIONAL FORENSICS, PROCEEDINGS, 2009, 5718 : 1 - 12
  • [35] Interchangeability of calibration audio datasets for forensic automatic speaker recognition
    van der Vloed, David
    [J]. 12TH INTERNATIONAL WORKSHOP ON BIOMETRICS AND FORENSICS, IWBF 2024, 2024,
  • [36] Modeling inter-speaker variability in speech recognition
    Cloarec, Gwenael
    Jouvet, Denis
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4529 - 4532
  • [37] Capturing local variability for speaker normalization in speech recognition
    Miguel, Antonio
    Lleida, Eduardo
    Rose, Richard
    Buera, Luis
    Saz, Oscar
    Ortega, Alfonso
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03): : 578 - 593
  • [38] Speaker clustering and transformation for speaker adaptation in speech recognition systems
    Padmanabhan, M
    Bahl, LR
    Nahamoo, D
    Picheny, MA
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (01): : 71 - 77
  • [39] Polyphone-IPSC: A shared speakers database for evaluation of forensic automatic speaker recognition systems
    Meuwly, D
    Alexander, A
    Drygajlo, A
    Botti, F
    [J]. FORENSIC SCIENCE INTERNATIONAL, 2003, 136 : 367 - 367
  • [40] On the Use of Speaker Information for Automatic Speech Recognition in Speaker-imbalanced Corpora
    Soky, Kak
    Li, Sheng
    Mimura, Masato
    Chu, Chenhui
    Kawahara, Tatsuya
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 433 - 437