Speech variability in automatic speaker recognition systems for commercial and forensic purposes

被引：0

作者：

Ortega-García, J ^{[1
]}

González-Rodríguez, J ^{[1
]}

Cruz-Llanas, S ^{[1
]}

机构：

[1] Univ Politecn Madrid, Dept Ingn Audiovisual & Comunicac, E-28031 Madrid, Spain

来源：

IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE | 2000年 / 15卷 / 11期

关键词：

D O I：

暂无

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Speaker Recognition is a major task when security applications through speech input are needed. Nevertheless, speech variability is a main degradation factor in speaker recognition tasks. Both intra-speaker and external variability sources produce mismatch between training and testing phases. In this contribution, channel and inter-session variability will be explored in order to accomplish real automatic systems for both commercial and forensic speaker recognition. Results will be presented making use of "AHUMADA," a subset of "GAUDI" large speaker recognition-oriented database in Spanish.

引用

页码：27 / 32

页数：6

共 50 条

[41] On the Use of Speaker Information for Automatic Speech Recognition in Speaker-imbalanced Corpora
Soky, Kak
Li, Sheng
Mimura, Masato
Chu, Chenhui
Kawahara, Tatsuya
[J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 433 - 437
[42] Extracting accent information from Urdu speech for forensic speaker recognition
Tahir, Falak
Saleem, Sajid
Ahmad, Ayaz
[J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (05) : 3763 - 3778
[43] Impact of Emotional Speech to Automatic Speaker Recognition - Experiments on GEES Speech Database
Jokic, Ivan
Jokic, Stevan
Delic, Vlado
Peric, Zoran
[J]. SPEECH AND COMPUTER, 2014, 8773 : 268 - 275
[44] Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition
Sivasankaran, Sunit
Vincent, Emmanuel
Fohr, Dominique
[J]. 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 346 - 350
[45] Methodologies for the evaluation of Speaker Diarization and Automatic Speech Recognition in the presence of overlapping speech
Galibert, Olivier
[J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1130 - 1133
[46] Analysis of Compressed Speech Signals in an Automatic Speaker Recognition System
Metzger, Richard A.
Doherty, John F.
Jenkins, David M.
[J]. 2015 49TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2015,
[47] SPEAKER-ADAPTABLE CLASSIFICATION PROCEDURE FOR AUTOMATIC SPEECH RECOGNITION
KATTERFELDT, H
THON, W
[J]. NACHRICHTENTECHNISCHE ZEITSCHRIFT, 1974, 27 (06): : 230 - 232
[48] DYNAMIC FREQUENCY WARPING FOR SPEAKER ADAPTATION IN AUTOMATIC SPEECH RECOGNITION
PALIWAL, KK
AINSWORTH, WA
[J]. JOURNAL OF PHONETICS, 1985, 13 (02) : 123 - 134
[49] Null-Hypothesis LLR: A proposal for Forensic Automatic Speaker Recognition
Solewicz, Yosef A.
Jessen, Michael
van der Vloed, David
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2849 - 2853
[50] The Case for Automatic Higher-Level Features in Forensic Speaker Recognition
Shriberg, Elizabeth
Stolcke, Andreas
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1509 - 1512

← 1 2 3 4 5 →