GMM-Based Speaker Gender and Age Classification After Voice Conversion

被引：0

作者：

Pribil, Jiri ^{[1
]}

Pribilova, Anna ^{[2
]}

Matousek, Jindrich ^{[3
]}

机构：

[1] SAS, Inst Measurement Sci, Bratislava, Slovakia

[2] FEE&IT SUT, Inst Elect & Photon, Bratislava, Slovakia

[3] UWB, Fac Sci Appl, Dept Cybernet, Plzen, Czech Republic

来源：

2016 FIRST INTERNATIONAL WORKSHOP ON SENSING, PROCESSING AND LEARNING FOR INTELLIGENT MACHINES (SPLINE) | 2016年

关键词：

GMM classifier; speech features; speaker gender and age classification; text-to-speech system; voice tranformation; TO-SPEECH SYSTEM; IDENTIFICATION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes an experiment using the Gaussian mixture models (GMM) for classification of the speaker gender/age and for evaluation of the achieved success in the voice conversion process. The main motivation of the work was to test whether this type of the classifier can be utilized as an alternative approach instead of the conventional listening test in the area of speech evaluation. The proposed two-level GMM classifier was first verified for detection of four age categories (child, young, adult, senior) as well as discrimination of gender for all but children's voices in Czech and Slovak languages. Then the classifier was applied for gender/age determination of the basic adult male/female original speech together with its conversion. The obtained resulting classification accuracy confirms usability of the proposed evaluation method and effectiveness of the performed voice conversions.

引用

页数：5

共 50 条

[1] GMM-based speaker age and gender classification in Czech and Slovak
Pribil, Jiri
Pribilova, Anna
Matousek, Jindrich
[J]. JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2017, 68 (01): : 3 - 12
[2] Evaluation of TTS Personification by GMM-Based Speaker Gender and Age Classifier
Pribil, Jiri
Pribilova, Anna
Matousek, Jindrich
[J]. TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 305 - 313
[3] Voice Conversion Using Bilinear Model Integrated with Joint GMM-based Classification
Sun, Xinjian
Zhang, Xiongwei
Yang, Jibin
Cao, Tieyong
[J]. 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 1225 - 1228
[4] Speaker Dependent Approach for Enhancing a Glossectomy Patient's Speech via GMM-based Voice Conversion
Tanaka, Kei
Hara, Sunao
Abe, Masanobu
Sato, Masaaki
Minagi, Shogo
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3384 - 3388
[5] Speaker and session variability in GMM-based speaker verification
Kenny, Patrick
Boulianne, Gilles
Ouellet, Pierre
Dumouchel, Pierre
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1448 - 1460
[6] Incorporating Global Variance in the Training Phase of GMM-based Voice Conversion
Hwang, Hsin-Te
Tsao, Yu
Wang, Hsin-Min
Wang, Yih-Ru
Chen, Sin-Horng
[J]. 2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
[7] Experimental Study on GMM-Based Speaker Recognition
Ye, Wenxing
Wu, Dapeng
Nucci, Antonio
[J]. MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2010, 2010, 7708
[8] Quantization for adapted GMM-based speaker verification
Tseng, Ivy H.
Verscheure, Olivier
Turaga, Deepak S.
Chaudhari, Upendra V.
[J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 653 - 656
[9] A GMM-Based Speaker Identification System on FPGA
Kan, Phak Len Eh
Allen, Tim
Quigley, Steven F.
[J]. RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2010, 5992 : 358 - 363
[10] FPGA Implementation for GMM-Based Speaker Identification
EhKan, Phaklen
Allen, Timothy
Quigley, Steven F.
[J]. INTERNATIONAL JOURNAL OF RECONFIGURABLE COMPUTING, 2011, 2011

← 1 2 3 4 5 →