Text-independent voice conversion based on state mapped codebook

被引：0

作者：

Zhang, Meng ^{[1
]}

Tao, Jianhua ^{[1
]}

Tian, Jilei ^{[2
]}

Wang, Xia ^{[3
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China

[2] Nokia Res Ctr, Interact Core Technol Ctr, Tampere, Finland

[3] Nokia Res Ctr, Tampere, Finland

来源：

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年

基金：

中国国家自然科学基金;

关键词：

text-independent; voice conversion; hidden Markov model; state mapped codebook;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Voice conversion has become more and more important in speech technology, but most of current works have to use parallel utterances of both source and target speaker as the training corpus, which limits the application of the technology. In the paper, we propose a new method of text-independent voice conversion which uses non-parallel corpus for the training. The Hidden Markov Model (HMM) is used to represent the phonetic structure of training speech and to generate the training pairs of source and target speakers by mapping the HMM states between source and target speeches. Then, HMM state mapped codebooks are generated to create the mapping function for the text-independent voice conversion. The subjective experiments based on ABX tests and MOS tests show that the method proposed in the paper gets the similar conversion performance and better speech quality compared to the conventional voice conversion systems.

引用

页码：4605 / +

页数：2

共 50 条

[1] PHONEME CLUSTER BASED STATE MAPPING FOR TEXT-INDEPENDENT VOICE CONVERSION
Zhang, Meng
Tao, Jiaohua
Nurminen, Jani
Tian, Jilei
Wang, Xia
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4281 - +
[2] Text-Independent Voice Conversion Based on Kernel Eigenvoice
Li, Yanping
Zhang, Linghua
Ding, Hui
[J]. ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2010, 6319 : 432 - +
[3] Text-independent voice conversion based on unit selection
Suendermann, David
Hoege, Harald
Bonafonte, Antonio
Ney, Hermann
Black, Alan
Narayanan, Shri
[J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 81 - 84
[4] Text-Independent Cross-Language Voice Conversion
Suendermann, David
Hoege, Harald
Bonafonte, Antonio
Ney, Hermann
Hirschberg, Julia
[J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2262 - +
[5] Supervisory Data Alignment for Text-Independent Voice Conversion
Tao, Jianhua
Zhang, Meng
Nurminen, Jani
Tian, Jilei
Wang, Xia
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (05): : 932 - 943
[6] Text-independent Writer Identification Based on Hybrid Codebook and Factor Analysis
Litifu, Ayixiamu
Yan, Yu-Chen
Xiao, Jin-Sheng
Jiang, Hao
Yao, Wei-Qing
[J]. Zidonghua Xuebao/Acta Automatica Sinica, 2021, 47 (09): : 2276 - 2284
[7] Text-Independent Voice Conversion Using Deep Neural Network Based Phonetic Level Features
Zheng, Huadi
Cai, Weicheng
Zhou, Tianyan
Zhang, Shilei
Li, Ming
[J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2872 - 2877
[8] Fuzzy training algorithm for wavelet codebook based text-independent speaker identification
Lung, SY
[J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (06) : 1619 - 1621
[9] Voice text-independent system for speaker identification
Babenko, LK
Makarevich, OB
Fedorov, VM
Yurkov, PY
[J]. IZVESTIYA VYSSHIKH UCHEBNYKH ZAVEDENII RADIOELEKTRONIKA, 2004, 47 (3-4): : 66 - 70
[10] Efficient genetic algorithm of codebook design for text-independent speaker recognition
Chen, CCT
Chen, CT
Lung, SY
[J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2002, E85A (11) : 2529 - 2531

← 1 2 3 4 5 →