GMM-Based Speaker Gender and Age Classification After Voice Conversion

被引:0
|
作者
Pribil, Jiri [1 ]
Pribilova, Anna [2 ]
Matousek, Jindrich [3 ]
机构
[1] SAS, Inst Measurement Sci, Bratislava, Slovakia
[2] FEE&IT SUT, Inst Elect & Photon, Bratislava, Slovakia
[3] UWB, Fac Sci Appl, Dept Cybernet, Plzen, Czech Republic
关键词
GMM classifier; speech features; speaker gender and age classification; text-to-speech system; voice tranformation; TO-SPEECH SYSTEM; IDENTIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes an experiment using the Gaussian mixture models (GMM) for classification of the speaker gender/age and for evaluation of the achieved success in the voice conversion process. The main motivation of the work was to test whether this type of the classifier can be utilized as an alternative approach instead of the conventional listening test in the area of speech evaluation. The proposed two-level GMM classifier was first verified for detection of four age categories (child, young, adult, senior) as well as discrimination of gender for all but children's voices in Czech and Slovak languages. Then the classifier was applied for gender/age determination of the basic adult male/female original speech together with its conversion. The obtained resulting classification accuracy confirms usability of the proposed evaluation method and effectiveness of the performed voice conversions.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] GMM-based speaker age and gender classification in Czech and Slovak
    Pribil, Jiri
    Pribilova, Anna
    Matousek, Jindrich
    [J]. JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2017, 68 (01): : 3 - 12
  • [2] Evaluation of TTS Personification by GMM-Based Speaker Gender and Age Classifier
    Pribil, Jiri
    Pribilova, Anna
    Matousek, Jindrich
    [J]. TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 305 - 313
  • [3] Voice Conversion Using Bilinear Model Integrated with Joint GMM-based Classification
    Sun, Xinjian
    Zhang, Xiongwei
    Yang, Jibin
    Cao, Tieyong
    [J]. 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 1225 - 1228
  • [4] Speaker Dependent Approach for Enhancing a Glossectomy Patient's Speech via GMM-based Voice Conversion
    Tanaka, Kei
    Hara, Sunao
    Abe, Masanobu
    Sato, Masaaki
    Minagi, Shogo
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3384 - 3388
  • [5] Speaker and session variability in GMM-based speaker verification
    Kenny, Patrick
    Boulianne, Gilles
    Ouellet, Pierre
    Dumouchel, Pierre
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1448 - 1460
  • [6] Incorporating Global Variance in the Training Phase of GMM-based Voice Conversion
    Hwang, Hsin-Te
    Tsao, Yu
    Wang, Hsin-Min
    Wang, Yih-Ru
    Chen, Sin-Horng
    [J]. 2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [7] Experimental Study on GMM-Based Speaker Recognition
    Ye, Wenxing
    Wu, Dapeng
    Nucci, Antonio
    [J]. MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2010, 2010, 7708
  • [8] Quantization for adapted GMM-based speaker verification
    Tseng, Ivy H.
    Verscheure, Olivier
    Turaga, Deepak S.
    Chaudhari, Upendra V.
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 653 - 656
  • [9] A GMM-Based Speaker Identification System on FPGA
    Kan, Phak Len Eh
    Allen, Tim
    Quigley, Steven F.
    [J]. RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2010, 5992 : 358 - 363
  • [10] FPGA Implementation for GMM-Based Speaker Identification
    EhKan, Phaklen
    Allen, Timothy
    Quigley, Steven F.
    [J]. INTERNATIONAL JOURNAL OF RECONFIGURABLE COMPUTING, 2011, 2011