Voice Conversion Based on Hybrid SVR and GMM

被引:1
|
作者
Song, Peng [1 ]
Jin, Yun [2 ,3 ]
Zhao, Li [1 ]
Zou, Cairong [1 ]
机构
[1] Southeast Univ, Minist Educ, Key Lab Underwater Acoust Signal Proc, Nanjing 210096, Jiangsu, Peoples R China
[2] Xuzhou Normal Univ, Sch Phys & Elect Engn, Xuzhou 221116, Peoples R China
[3] Southeast Univ, Minist Educ, Key Lab Child Dev & Learning Sci, Nanjing 210096, Jiangsu, Peoples R China
关键词
voice conversion; support vector regression; Gaussian mixture model; F0; prediction; speaker-specific information;
D O I
10.2478/v10168-012-0020-9
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A novel VC (voice conversion) method based on hybrid SVR (support vector regression) and GMM (Gaussian mixture model) is presented in the paper, the mapping abilities of SVR and GMM are exploited to map the spectral features of the source speaker to those of target ones. A new strategy of F0 transformation is also presented, the F0s are modeled with spectral features in a joint GMM and predicted from the converted spectral features using the SVR method. Subjective and objective tests are carried out to evaluate the VC performance; experimental results show that the converted speech using the proposed method can obtain a better quality than that using the state-of-the-art GMM method. Meanwhile, a VC method based on non-parallel data is also proposed, the speaker-specific information is investigated using the SVR method and preliminary subjective experiments demonstrate that the proposed method is feasible when a parallel corpus is not available.
引用
收藏
页码:143 / 149
页数:7
相关论文
共 50 条
  • [31] A Time-scale Alternation Method in GMM Voice Conversion System
    Zhou Ying
    Zhang Ling-hua
    [J]. ELECTRONIC INFORMATION AND ELECTRICAL ENGINEERING, 2012, 19 : 161 - 164
  • [32] High-quality voice conversion system based on GMM statistical parameters and RBF neural network
    CHEN Xian-tong
    ZHANG Ling-hua
    [J]. The Journal of China Universities of Posts and Telecommunications, 2014, (05) : 68 - 75
  • [33] High-quality voice conversion system based on GMM statistical parameters and RBF neural network
    CHEN Xian-tong
    ZHANG Ling-hua
    [J]. TheJournalofChinaUniversitiesofPostsandTelecommunications, 2014, 21 (05) : 68 - 75+93
  • [34] IMPROVED ESTIMATION OF FEMININITY USING GMM SUPERVECTORS AND SVR FOR VOICE THERAPY OF GENDER IDENTITY DISORDER CLIENTS
    Wang, Chengshuo
    Suzuki, Masayuki
    Minematsu, Nobuaki
    Sakuraba, Kyoko
    Hirose, Keikichi
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7751 - 7754
  • [35] Voice Conversion Method Combining Segmental GMM Mapping with Target Frame Selection
    Gu, Hung-Yan
    Tsai, Sung-Feng
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2015, 31 (02) : 609 - 626
  • [36] Speaker Dependent Approach for Enhancing a Glossectomy Patient's Speech via GMM-based Voice Conversion
    Tanaka, Kei
    Hara, Sunao
    Abe, Masanobu
    Sato, Masaaki
    Minagi, Shogo
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3384 - 3388
  • [37] Improving the Quality of Standard GMM-Based Voice Conversion Systems by Considering Physically Motivated Linear Transformations
    Zorila, Tudor-Catalin
    Erro, Daniel
    Hernaez, Inma
    [J]. ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, 2012, 328 : 30 - 39
  • [38] A Statistical Sample-Based Approach to GMM-Based Voice Conversion Using Tied-Covariance Acoustic Models
    Takamichi, Shinnosuke
    Toda, Tomoki
    Neubig, Graham
    Sakti, Sakriani
    Nakamura, Satoshi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (10): : 2490 - 2498
  • [39] An SVR based hybrid modeling method
    Sun Z.
    Zhao Q.
    Zhao H.
    Feng W.
    Zhang W.
    Yang T.
    [J]. Zhao, Hongbo (bhzhb@buaa.edu.cn), 2017, Beijing University of Aeronautics and Astronautics (BUAA) (43): : 352 - 359
  • [40] Phoneme Independent Pathological Voice Detection Using Wavelet Based MFCCs, GMM-SVM Hybrid Classifier
    Vikram, C. M.
    Umarani, K.
    [J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 929 - 934