Age and Gender Classification using Fusion of Acoustic and Prosodic Features

被引:0
|
作者
Meinedo, Hugo [1 ]
Trancoso, Isabel [1 ]
机构
[1] INESC ID Lisboa, Spoken Language Syst Lab L2F, Lisbon, Portugal
关键词
Paralinguistic Challenge; Age; Gender; Fusion of Acoustic and Prosodic Features; SPEECH;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a description of the INESC-ID Spoken Language Systems Laboratory (L2F) Age and Gender classification system submitted to the INTERSPEECH 2010 Paralinguistic Challenge. The L2F Age classification system and the Gender classification system are composed respectively by the fusion of four and six individual sub-systems trained with short and long term acoustic and prosodic features, different classification strategies (GMM-UBM, MLP and SVM) and using four different speech corpora. The best results obtained by the calibration and linear logistic regression fusion back-end show an absolute improvement of 4.1% on the unweighted accuracy value for the Age and 5.8% for the Gender when compared to the competition baseline systems in the development set.
引用
收藏
页码:2822 / 2825
页数:4
相关论文
共 50 条
  • [1] Automatic speaker age and gender recognition using acoustic and prosodic level information fusion
    Li, Ming
    Han, Kyu J.
    Narayanan, Shrikanth
    [J]. COMPUTER SPEECH AND LANGUAGE, 2013, 27 (01): : 151 - 167
  • [2] Fusion of Acoustic and Prosodic Features for Speaker Clustering
    Zibert, Janez
    Mihelic, France
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2009, 5729 : 210 - +
  • [3] Age and Gender Classification Using EEG Paralinguistic Features
    Phuoc Nguyen
    Dat Tran
    Huang, Xu
    Ma, Wanli
    [J]. 2013 6TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING (NER), 2013, : 1295 - 1298
  • [4] Using Prosodic and Acoustic Features for Chinese Dialects Identification
    Sun, Linjia
    [J]. PROCEEDINGS OF 2020 2ND INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND MACHINE VISION AND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND MACHINE LEARNING, IPMV 2020, 2020, : 118 - 123
  • [5] Robust Acoustic Event Classification using Fusion Fisher Vector features
    Mulimani, Manjunath
    Koolagudi, Shashidhar G.
    [J]. APPLIED ACOUSTICS, 2019, 155 : 130 - 138
  • [6] Acoustic Classification of Bird Species Using an Early Fusion of Deep Features
    Xie, Jie
    Zhu, Mingying
    [J]. BIRDS, 2023, 4 (01): : 138 - 147
  • [7] Deep Learning for Asphyxiated Infant Cry Classification Based on Acoustic Features and Weighted Prosodic Features
    Ji, Chunyan
    Xiao, Xueli
    Basodi, Sunitha
    Pan, Yi
    [J]. 2019 INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2019, : 1233 - 1240
  • [8] Periocular Data Fusion for Age and Gender Classification
    Bisogni, Carmen
    Cascone, Lucia
    Narducci, Fabio
    [J]. JOURNAL OF IMAGING, 2022, 8 (11)
  • [9] Input Fusion of MFCC and SCMC Features for Acoustic Scene Classification using DNN
    Paseddula, Chandrasekhar
    Gangashetty, Suryakanth V.
    [J]. 2018 IEEE 13TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (IEEE ICIIS), 2018, : 26 - 30
  • [10] Emotion Classification in Children's Speech Using Fusion of Acoustic and Linguistic Features
    Polzehl, Tim
    Sundaram, Shiva
    Ketabdar, Hamed
    Wagner, Michael
    Metze, Florian
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 340 - +