\ COMBINING REGRESSION AND CLASSIFICATION METHODS FOR IMPROVING AUTOMATIC SPEAKER AGE RECOGNITION

被引：21

作者：

van Heerden, Charl ^{[1
]}

Barnard, Etienne ^{[1
]}

Davel, Marelie ^{[1
]}

van der Walt, Christiaan ^{[1
]}

van Dyk, Ewald ^{[1
,2
]}

Feld, Michael ^{[2
]}

Mueller, Christian ^{[2
]}

机构：

[1] CSIR, Human Language Technol Meraka Inst, Pretoria, South Africa

[2] German Res Ctr AI, Intrlligent User Interface, Berlin, Germany

来源：

2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年

关键词：

Age classification; gender classification; support vector machines;

D O I：

10.1109/ICASSP.2010.5495006

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We present a novel approach to automatic speaker age classification, which combines regression and classification to achieve competitive classification accuracy on telephone speech. Support vector machine regression is used to generate finer age estimates, which are combined with the posterior probabilities of well-trained discriminative gender classifiers to predict both the age and gender of a speaker. We show that this combination performs better than direct 7-class classifiers. The regressors and classifiers are trained using long-term features such as pitch and formants, as well as short-term (frame-based) features derived from MAP adaptation of GMMs that were trained on MFCCs.

引用

页码：5174 / 5177

页数：4

共 50 条

[21] Automatic Speaker Recognition with Limited Data
Li, Ruirui
Jiang, Jyun-Yu
Liu, Jiahao
Hsieh, Chu-Cheng
Wang, Wei
PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 340 - 348
[22] Voice Disguise in Automatic Speaker Recognition
Farrus, Mireia
ACM COMPUTING SURVEYS, 2018, 51 (04)
[23] SPEAKER NORMALIZATION FOR AUTOMATIC WORD RECOGNITION
BOEHM, JF
WRIGHT, RD
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 49 (01): : 133 - &
[24] Subband architecture for automatic speaker recognition
Besacier, L
Bonastre, JF
SIGNAL PROCESSING, 2000, 80 (07) : 1245 - 1259
[25] Voice disguise and automatic speaker recognition
Zhang, Cuiling
Tan, Tiejun
FORENSIC SCIENCE INTERNATIONAL, 2008, 175 (2-3) : 118 - 122
[26] ADAPTING TO THE SPEAKER IN AUTOMATIC SPEECH RECOGNITION
TALBOT, M
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1987, 27 (04): : 449 - 457
[27] An overview of automatic speaker recognition technology
Reynolds, DA
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4072 - 4075
[28] Automatic speaker recognition of identical twins
Kuenzel, Hermann J.
INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2010, 17 (02) : 251 - 277
[29] A Survey on Automatic Speaker Recognition Systems
Saquib, Zia
Salam, Nirmala
Nair, Rekha P.
Pandey, Nipun
Joshi, Akanksha
SIGNAL PROCESSING AND MULTIMEDIA, 2010, 123 : 134 - 145
[30] Evaluations of automatic speaker classification systems
Martin, Alvin F.
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2007, 4343 LNAI : 313 - 329

← 1 2 3 4 5 →