Speaker age classification and regression using i-vectors

被引:31
|
作者
Grzybowska, Joanna [1 ]
Kacprzak, Stanislaw [1 ]
机构
[1] AGH Univ Sci & Technol, Krakow, Poland
关键词
speaker age recognition; regression; classification; computational paralinguistics; RECOGNITION;
D O I
10.21437/Interspeech.2016-1118
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we examine the use of i-vectors both for age regression as well as for age classification. Although i-vectors have been previously used for age regression task, we extend this approach by applying fusion of i-vectors and acoustic features regression to estimate the speaker age. By our fusion we obtain a relative improvement of 12.6% comparing to solely i-vector system. We also use i-vectors for age classification, which to our knowledge is the first attempt to do so. Our best results reach unweighted accuracy 62.9%, which is a relative improvement of 16.7% comparing to the best results obtained in age classification task at Age Sub-Challenge at Interspeech 2010.
引用
收藏
页码:1402 / 1406
页数:5
相关论文
共 50 条
  • [1] Speaker age estimation using i-vectors
    Bahari, Mohamad Hasan
    McLaren, Mitchell
    Hugo Van Hamme
    van Leeuwen, David A.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 34 : 99 - 108
  • [2] Migrating i-vectors Between Speaker Recognition Systems Using Regression Neural Networks
    Glembek, Ondrej
    Matejka, Pavel
    Plchot, Oldrich
    Pesan, Jan
    Burget, Lukas
    Schwarz, Petr
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2327 - 2331
  • [3] Native Accent Classification via I-Vectors and Speaker Compensation Fusion
    DeMarco, Andrea
    Cox, Stephen J.
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1471 - 1475
  • [4] I-vectors for image classification
    Smith, David C.
    [J]. APPLICATIONS OF DIGITAL IMAGE PROCESSING XXXVII, 2014, 9217
  • [5] Senone I-Vectors for Robust Speaker Verification
    Tan, Zhili
    Zhu, Yingke
    Mak, Man-Wai
    Mak, Brian Kan-Wing
    [J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [6] Robust Speaker Verification Using GFCC Based i-Vectors
    Jeevan, Medikonda
    Dhingra, Atul
    Hanmandlu, M.
    Panigrahi, B. K.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL, NETWORKS, COMPUTING, AND SYSTEMS (ICSNCS 2016), VOL 1, 2017, 395 : 85 - 91
  • [7] Emotional Speaker Verification Based on I-vectors
    Mackova, Lenka
    Cizmar, Anton
    [J]. 2014 5TH IEEE CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2014, : 533 - 536
  • [8] SPEAKER AGE ESTIMATION ON CONVERSATIONAL TELEPHONE SPEECH USING SENONE POSTERIOR BASED I-VECTORS
    Sadjadi, Seyed Omid
    Ganapathy, Sriram
    Pelecanos, Jason W.
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5040 - 5044
  • [9] Development of Speaker Recognizer Using I-vectors in Two Programming Environments
    Jakubec, Maros
    Lieskovska, Eva
    Jarina, Roman
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON NEW TRENDS IN SIGNAL PROCESSING (NTSP), 2020, : 34 - 38
  • [10] Speaker Verification using Sparse Representations on Total Variability I-Vectors
    Li, Ming
    Zhang, Xiang
    Yan, Yonghong
    Narayanan, Shrikanth
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2740 - +