\ COMBINING REGRESSION AND CLASSIFICATION METHODS FOR IMPROVING AUTOMATIC SPEAKER AGE RECOGNITION

被引:21
|
作者
van Heerden, Charl [1 ]
Barnard, Etienne [1 ]
Davel, Marelie [1 ]
van der Walt, Christiaan [1 ]
van Dyk, Ewald [1 ,2 ]
Feld, Michael [2 ]
Mueller, Christian [2 ]
机构
[1] CSIR, Human Language Technol Meraka Inst, Pretoria, South Africa
[2] German Res Ctr AI, Intrlligent User Interface, Berlin, Germany
关键词
Age classification; gender classification; support vector machines;
D O I
10.1109/ICASSP.2010.5495006
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a novel approach to automatic speaker age classification, which combines regression and classification to achieve competitive classification accuracy on telephone speech. Support vector machine regression is used to generate finer age estimates, which are combined with the posterior probabilities of well-trained discriminative gender classifiers to predict both the age and gender of a speaker. We show that this combination performs better than direct 7-class classifiers. The regressors and classifiers are trained using long-term features such as pitch and formants, as well as short-term (frame-based) features derived from MAP adaptation of GMMs that were trained on MFCCs.
引用
收藏
页码:5174 / 5177
页数:4
相关论文
共 50 条
  • [21] Automatic Speaker Recognition with Limited Data
    Li, Ruirui
    Jiang, Jyun-Yu
    Liu, Jiahao
    Hsieh, Chu-Cheng
    Wang, Wei
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 340 - 348
  • [22] Voice Disguise in Automatic Speaker Recognition
    Farrus, Mireia
    ACM COMPUTING SURVEYS, 2018, 51 (04)
  • [23] SPEAKER NORMALIZATION FOR AUTOMATIC WORD RECOGNITION
    BOEHM, JF
    WRIGHT, RD
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 49 (01): : 133 - &
  • [24] Subband architecture for automatic speaker recognition
    Besacier, L
    Bonastre, JF
    SIGNAL PROCESSING, 2000, 80 (07) : 1245 - 1259
  • [25] Voice disguise and automatic speaker recognition
    Zhang, Cuiling
    Tan, Tiejun
    FORENSIC SCIENCE INTERNATIONAL, 2008, 175 (2-3) : 118 - 122
  • [26] ADAPTING TO THE SPEAKER IN AUTOMATIC SPEECH RECOGNITION
    TALBOT, M
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1987, 27 (04): : 449 - 457
  • [27] An overview of automatic speaker recognition technology
    Reynolds, DA
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4072 - 4075
  • [28] Automatic speaker recognition of identical twins
    Kuenzel, Hermann J.
    INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2010, 17 (02) : 251 - 277
  • [29] A Survey on Automatic Speaker Recognition Systems
    Saquib, Zia
    Salam, Nirmala
    Nair, Rekha P.
    Pandey, Nipun
    Joshi, Akanksha
    SIGNAL PROCESSING AND MULTIMEDIA, 2010, 123 : 134 - 145
  • [30] Evaluations of automatic speaker classification systems
    Martin, Alvin F.
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2007, 4343 LNAI : 313 - 329