Towards Fine-Grained Prosody Control for Voice Conversion

被引:2
|
作者
Lian, Zheng [1 ,3 ]
Zhong, Rongxiu [1 ,3 ]
Wen, Zhengqi [1 ]
Liu, Bin [1 ]
Tao, Jianhua [1 ,2 ,3 ]
机构
[1] CASIA, Natl Lab Pattern Recognit, Beijing, Peoples R China
[2] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
voice conversion (VC); phonetic posteriorgrams (PPGs); prosody embeddings; LPCNet vocoder; SPEECH SYNTHESIS;
D O I
10.1109/ISCSLP49672.2021.9362110
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a typical voice conversion system, previous works utilized various acoustic features (such as the pitch, voiced/unvoiced flag and aperiodicity) of the source speech to control the prosody of converted speech. However, prosody is related with many factors, such as the intonation, stress and rhythm. It is a challenging task to perfectly describe prosody through handcrafted acoustic features. To address these difficulties, we propose to use prosody embeddings to describe prosody. These embeddings are learned from the source speech in an unsupervised manner. To verify the effectiveness of our proposed method, we conduct experiments on our Mandarin corpus. Experimental results show that our proposed method can improve the speech quality and speaker similarity of the converted speech. What's more, we observe that our method can even achieve promising results in singing conditions.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] DISENTANGLING CONTENT AND FINE-GRAINED PROSODY INFORMATION VIA HYBRID ASR BOTTLENECK FEATURES FOR VOICE CONVERSION
    Zhao, Xintao
    Liu, Feng
    Song, Changhe
    Wu, Zhiyong
    Kang, Shiyin
    Tuo, Deyi
    Meng, Helen
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7022 - 7026
  • [2] Towards a fine-grained access control for Cloud
    Msahli, Mounira
    Chen, Xiuzhen
    Serhrouchni, Ahmed
    [J]. 2014 IEEE 11TH INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING (ICEBE), 2014, : 286 - 291
  • [3] ROBUST AND FINE-GRAINED PROSODY CONTROL OF END-TO-END SPEECH SYNTHESIS
    Lee, Younggun
    Kim, Taesu
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5911 - 5915
  • [4] Towards Fine-grained and Practical Flow Control for Datacenter Networks
    Li, Wenxue
    Zeng, Chaoliang
    Hu, Jinbin
    Chen, Kai
    [J]. 2023 IEEE 31ST INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS, ICNP, 2023,
  • [5] Towards Fine-Grained Access Control in Java']JavaScript Contexts
    Patil, Kailas
    Dong, Xinshu
    Li, Xiaolei
    Liang, Zhenkai
    Jiang, Xuxian
    [J]. 31ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2011), 2011, : 720 - 729
  • [6] Towards supporting fine-grained access control for Grid resources
    Bertino, E
    Mazzoleni, P
    Crispo, B
    Sivasubramanian, S
    Ferrari, E
    [J]. 10TH IEEE INTERNATIONAL WORKSHOP ON FUTURE TRENDS OF DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2004, : 59 - 65
  • [7] Towards Secure Cloud Database with Fine-Grained Access Control
    Solomon, Michael G.
    Sunderam, Vaidy
    Xiong, Li
    [J]. DATA AND APPLICATIONS SECURITY AND PRIVACY XXVIII, 2014, 8566 : 324 - 338
  • [8] Towards Fine-Grained Recognition: Joint Learning for Object Detection and Fine-Grained Classification
    Wang, Qiaosong
    Rasmussen, Christopher
    [J]. ADVANCES IN VISUAL COMPUTING, ISVC 2019, PT II, 2019, 11845 : 332 - 344
  • [9] Towards Fine-Grained Concept Generation
    Li, Chenguang
    Liang, Jiaqing
    Xiao, Yanghua
    Jiang, Haiyun
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 986 - 997
  • [10] Towards a fine-grained access control model and mechanisms for semantic databases
    Franzoni, Stefano
    Mazzoleni, Pietro
    Valtolina, Stefano
    Mazzoleni, Pietro
    Bertino, Elisa
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, PROCEEDINGS, 2007, : 993 - +