GMM-based speaker age and gender classification in Czech and Slovak

被引:14
|
作者
Pribil, Jiri [1 ,2 ]
Pribilova, Anna [3 ]
Matousek, Jindrich [4 ]
机构
[1] Slovak Acad Sci, Inst Measurement Sci, Bratislava, Slovakia
[2] Univ West Bohemia, Fac Sci Appl, NTIS, Plze, Czech Republic
[3] Slovak Univ Technol Bratislava, Fac Elect Engn & Informat Technol, Ilkovicova 3, Bratislava 81219, Slovakia
[4] Univ West Bohemia, Fac Sci Appl, Dept Cybernet, NTIS, Plzen, Czech Republic
关键词
GMM classifier; spectral and prosodic features of speech; speaker gender and age classification; VOICE; RECOGNITION;
D O I
10.1515/jee-2017-0001
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The paper describes an experiment with using the Gaussian mixture models (GMM) for automatic classification of the speaker age and gender. It analyses and compares the influence of different number of mixtures and different types of speech features used for GMM gender/age classification. Dependence of the computational complexity on the number of used mixtures is also analysed. Finally, the GMM classification accuracy is compared with the output of the conventional listening tests. The results of these objective and subjective evaluations are in correspondence.
引用
收藏
页码:3 / 12
页数:10
相关论文
共 50 条
  • [1] GMM-Based Speaker Gender and Age Classification After Voice Conversion
    Pribil, Jiri
    Pribilova, Anna
    Matousek, Jindrich
    [J]. 2016 FIRST INTERNATIONAL WORKSHOP ON SENSING, PROCESSING AND LEARNING FOR INTELLIGENT MACHINES (SPLINE), 2016,
  • [2] Evaluation of TTS Personification by GMM-Based Speaker Gender and Age Classifier
    Pribil, Jiri
    Pribilova, Anna
    Matousek, Jindrich
    [J]. TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 305 - 313
  • [3] GMM-Based Evaluation of Emotional Style Transformation in Czech and Slovak
    Pribil, Jiri
    Pribilova, Anna
    [J]. COGNITIVE COMPUTATION, 2014, 6 (04) : 928 - 939
  • [4] GMM-Based Evaluation of Emotional Style Transformation in Czech and Slovak
    Jiří Přibil
    Anna Přibilová
    [J]. Cognitive Computation, 2014, 6 : 928 - 939
  • [5] Speaker and session variability in GMM-based speaker verification
    Kenny, Patrick
    Boulianne, Gilles
    Ouellet, Pierre
    Dumouchel, Pierre
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1448 - 1460
  • [6] Experimental Study on GMM-Based Speaker Recognition
    Ye, Wenxing
    Wu, Dapeng
    Nucci, Antonio
    [J]. MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2010, 2010, 7708
  • [7] Quantization for adapted GMM-based speaker verification
    Tseng, Ivy H.
    Verscheure, Olivier
    Turaga, Deepak S.
    Chaudhari, Upendra V.
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 653 - 656
  • [8] A GMM-Based Speaker Identification System on FPGA
    Kan, Phak Len Eh
    Allen, Tim
    Quigley, Steven F.
    [J]. RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2010, 5992 : 358 - 363
  • [9] FPGA Implementation for GMM-Based Speaker Identification
    EhKan, Phaklen
    Allen, Timothy
    Quigley, Steven F.
    [J]. INTERNATIONAL JOURNAL OF RECONFIGURABLE COMPUTING, 2011, 2011
  • [10] GMM-based classification of genomic sequences
    Akhtar, Mahmood
    Ambikairajah, Eliathamby
    Epps, Julien
    [J]. PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 103 - +