TOWARDS SPEAKER AGE ESTIMATION WITH LABEL DISTRIBUTION LEARNING

被引:15
|
作者
Si, Shijing [1 ]
Wang, Jianzong [1 ]
Peng, Junqing [1 ]
Xiao, Jing [1 ]
机构
[1] Ping An Technol Shenzhen Co Ltd, Shenzhen, Peoples R China
关键词
Speaker age estimation; Label distribution learning; Variance regularization; Attribute inference;
D O I
10.1109/ICASSP43922.2022.9746378
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Existing methods for speaker age estimation usually treat it as a multi-class classification or a regression problem. However, precise age identification remains a challenge due to label ambiguity, i.e., utterances from adjacent age of the same person are often indistinguishable. To address this, we utilize the ambiguous information among the age labels, convert each age label into a discrete label distribution and leverage the label distribution learning (LDL) method to fit the data. For each audio data sample, our method produces a age distribution of its speaker, and on top of the distribution we also perform two other tasks: age prediction and age uncertainty minimization. Therefore, our method naturally combines the age classification and regression approaches, which enhances the robustness of our method. We conduct experiments on the public NIST SRE08-10 dataset and a real-world dataset, which exhibit that our method outperforms baseline methods by a relatively large margin, yielding a 10% reduction in terms of mean absolute error (MAE) on a real-world dataset.
引用
收藏
页码:4618 / 4622
页数:5
相关论文
共 50 条
  • [1] SVLDL: IMPROVED SPEAKER AGE ESTIMATION USING SELECTIVE VARIANCE LABEL DISTRIBUTION LEARNING
    Kang, Zuheng
    Wang, Jianzong
    Peng, Junqing
    Xiao, Jing
    [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 1037 - 1044
  • [2] Deep Label Distribution Learning for Apparent Age Estimation
    Yang, Xu
    Gao, Bin-Bin
    Xing, Chao
    Huo, Zeng-Wei
    Wei, Xiu-Shen
    Zhou, Ying
    Wu, Jianxin
    Geng, Xin
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 344 - 350
  • [3] Facial Age Estimation by Adaptive Label Distribution Learning
    Geng, Xin
    Wang, Qin
    Xia, Yu
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 4465 - 4470
  • [4] Learning with Ambiguous Label Distribution for Apparent Age Estimation
    Chen, Ke
    Kamarainen, Joni-Kristian
    [J]. COMPUTER VISION - ACCV 2016, PT III, 2017, 10113 : 330 - 343
  • [5] Age Estimation Using Expectation of Label Distribution Learning
    Gao, Bin-Bin
    Zhou, Hong-Yu
    Wu, Jianxin
    Geng, Xin
    [J]. PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 712 - 718
  • [6] Data-Dependent Label Distribution Learning for Age Estimation
    He, Zhouzhou
    Li, Xi
    Zhang, Zhongfei
    Wu, Fei
    Geng, Xin
    Zhang, Yaqing
    Yang, Ming-Hsuan
    Zhuang, Yueting
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (08) : 3846 - 3858
  • [7] Practical age estimation using deep label distribution learning
    Huiying ZHANG
    Yu ZHANG
    Xin GENG
    [J]. Frontiers of Computer Science., 2021, (03) - 47
  • [8] Practical age estimation using deep label distribution learning
    Zhang, Huiying
    Zhang, Yu
    Geng, Xin
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2021, 15 (03)
  • [9] Practical age estimation using deep label distribution learning
    Huiying Zhang
    Yu Zhang
    Xin Geng
    [J]. Frontiers of Computer Science, 2021, 15
  • [10] Semi-Supervised Adaptive Label Distribution Learning for Facial Age Estimation
    Hou, Peng
    Geng, Xin
    Huo, Zeng-Wei
    Lv, Jia-Qi
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2015 - 2021