Text-independent voice conversion based on state mapped codebook

被引:0
|
作者
Zhang, Meng [1 ]
Tao, Jianhua [1 ]
Tian, Jilei [2 ]
Wang, Xia [3 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
[2] Nokia Res Ctr, Interact Core Technol Ctr, Tampere, Finland
[3] Nokia Res Ctr, Tampere, Finland
基金
中国国家自然科学基金;
关键词
text-independent; voice conversion; hidden Markov model; state mapped codebook;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Voice conversion has become more and more important in speech technology, but most of current works have to use parallel utterances of both source and target speaker as the training corpus, which limits the application of the technology. In the paper, we propose a new method of text-independent voice conversion which uses non-parallel corpus for the training. The Hidden Markov Model (HMM) is used to represent the phonetic structure of training speech and to generate the training pairs of source and target speakers by mapping the HMM states between source and target speeches. Then, HMM state mapped codebooks are generated to create the mapping function for the text-independent voice conversion. The subjective experiments based on ABX tests and MOS tests show that the method proposed in the paper gets the similar conversion performance and better speech quality compared to the conventional voice conversion systems.
引用
收藏
页码:4605 / +
页数:2
相关论文
共 50 条
  • [1] PHONEME CLUSTER BASED STATE MAPPING FOR TEXT-INDEPENDENT VOICE CONVERSION
    Zhang, Meng
    Tao, Jiaohua
    Nurminen, Jani
    Tian, Jilei
    Wang, Xia
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4281 - +
  • [2] Text-Independent Voice Conversion Based on Kernel Eigenvoice
    Li, Yanping
    Zhang, Linghua
    Ding, Hui
    [J]. ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2010, 6319 : 432 - +
  • [3] Text-independent voice conversion based on unit selection
    Suendermann, David
    Hoege, Harald
    Bonafonte, Antonio
    Ney, Hermann
    Black, Alan
    Narayanan, Shri
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 81 - 84
  • [4] Text-Independent Cross-Language Voice Conversion
    Suendermann, David
    Hoege, Harald
    Bonafonte, Antonio
    Ney, Hermann
    Hirschberg, Julia
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2262 - +
  • [5] Supervisory Data Alignment for Text-Independent Voice Conversion
    Tao, Jianhua
    Zhang, Meng
    Nurminen, Jani
    Tian, Jilei
    Wang, Xia
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (05): : 932 - 943
  • [6] Text-independent Writer Identification Based on Hybrid Codebook and Factor Analysis
    Litifu, Ayixiamu
    Yan, Yu-Chen
    Xiao, Jin-Sheng
    Jiang, Hao
    Yao, Wei-Qing
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2021, 47 (09): : 2276 - 2284
  • [7] Text-Independent Voice Conversion Using Deep Neural Network Based Phonetic Level Features
    Zheng, Huadi
    Cai, Weicheng
    Zhou, Tianyan
    Zhang, Shilei
    Li, Ming
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2872 - 2877
  • [8] Fuzzy training algorithm for wavelet codebook based text-independent speaker identification
    Lung, SY
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (06) : 1619 - 1621
  • [9] Voice text-independent system for speaker identification
    Babenko, LK
    Makarevich, OB
    Fedorov, VM
    Yurkov, PY
    [J]. IZVESTIYA VYSSHIKH UCHEBNYKH ZAVEDENII RADIOELEKTRONIKA, 2004, 47 (3-4): : 66 - 70
  • [10] Efficient genetic algorithm of codebook design for text-independent speaker recognition
    Chen, CCT
    Chen, CT
    Lung, SY
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2002, E85A (11) : 2529 - 2531