Speech variability in automatic speaker recognition systems for commercial and forensic purposes

被引：0

作者：

Ortega-García, J ^{[1
]}

González-Rodríguez, J ^{[1
]}

Cruz-Llanas, S ^{[1
]}

机构：

[1] Univ Politecn Madrid, Dept Ingn Audiovisual & Comunicac, E-28031 Madrid, Spain

来源：

IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE | 2000年 / 15卷 / 11期

关键词：

D O I：

暂无

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Speaker Recognition is a major task when security applications through speech input are needed. Nevertheless, speech variability is a main degradation factor in speaker recognition tasks. Both intra-speaker and external variability sources produce mismatch between training and testing phases. In this contribution, channel and inter-session variability will be explored in order to accomplish real automatic systems for both commercial and forensic speaker recognition. Results will be presented making use of "AHUMADA," a subset of "GAUDI" large speaker recognition-oriented database in Spanish.

引用

页码：27 / 32

页数：6

共 50 条

[31] SPEECH RECOGNITION SYSTEM WITH AUTOMATIC SPEAKER-ADAPTION
BROUWER, P
[J]. FREQUENZ, 1978, 32 (07) : 204 - 207
[32] An Automatic Speech Recognition Solution with Speaker Identification Support
Buzo, Andi
Cucu, Horia
Petrica, Lucian
Burileanu, Dragos
Burileanu, Corneliu
[J]. 2014 10TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2014,
[33] Improved automatic speech recognition through speaker normalization
Giuliani, D
Gerosa, M
Brugnara, F
[J]. COMPUTER SPEECH AND LANGUAGE, 2006, 20 (01): : 107 - 123
[34] Statistical Evaluation of Biometric Evidence in Forensic Automatic Speaker Recognition
Drygajlo, Andrzej
[J]. COMPUTATIONAL FORENSICS, PROCEEDINGS, 2009, 5718 : 1 - 12
[35] Interchangeability of calibration audio datasets for forensic automatic speaker recognition
van der Vloed, David
[J]. 12TH INTERNATIONAL WORKSHOP ON BIOMETRICS AND FORENSICS, IWBF 2024, 2024,
[36] Modeling inter-speaker variability in speech recognition
Cloarec, Gwenael
Jouvet, Denis
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4529 - 4532
[37] Capturing local variability for speaker normalization in speech recognition
Miguel, Antonio
Lleida, Eduardo
Rose, Richard
Buera, Luis
Saz, Oscar
Ortega, Alfonso
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03): : 578 - 593
[38] Speaker clustering and transformation for speaker adaptation in speech recognition systems
Padmanabhan, M
Bahl, LR
Nahamoo, D
Picheny, MA
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (01): : 71 - 77
[39] Polyphone-IPSC: A shared speakers database for evaluation of forensic automatic speaker recognition systems
Meuwly, D
Alexander, A
Drygajlo, A
Botti, F
[J]. FORENSIC SCIENCE INTERNATIONAL, 2003, 136 : 367 - 367
[40] On the Use of Speaker Information for Automatic Speech Recognition in Speaker-imbalanced Corpora
Soky, Kak
Li, Sheng
Mimura, Masato
Chu, Chenhui
Kawahara, Tatsuya
[J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 433 - 437

← 1 2 3 4 5 →