Towards Fine-Grained Prosody Control for Voice Conversion

被引:2
|
作者
Lian, Zheng [1 ,3 ]
Zhong, Rongxiu [1 ,3 ]
Wen, Zhengqi [1 ]
Liu, Bin [1 ]
Tao, Jianhua [1 ,2 ,3 ]
机构
[1] CASIA, Natl Lab Pattern Recognit, Beijing, Peoples R China
[2] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
voice conversion (VC); phonetic posteriorgrams (PPGs); prosody embeddings; LPCNet vocoder; SPEECH SYNTHESIS;
D O I
10.1109/ISCSLP49672.2021.9362110
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a typical voice conversion system, previous works utilized various acoustic features (such as the pitch, voiced/unvoiced flag and aperiodicity) of the source speech to control the prosody of converted speech. However, prosody is related with many factors, such as the intonation, stress and rhythm. It is a challenging task to perfectly describe prosody through handcrafted acoustic features. To address these difficulties, we propose to use prosody embeddings to describe prosody. These embeddings are learned from the source speech in an unsupervised manner. To verify the effectiveness of our proposed method, we conduct experiments on our Mandarin corpus. Experimental results show that our proposed method can improve the speech quality and speaker similarity of the converted speech. What's more, we observe that our method can even achieve promising results in singing conditions.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Learning fine-grained control for mapless navigation
    de Villiers, Fred
    Brink, Willie
    [J]. 2020 INTERNATIONAL SAUPEC/ROBMECH/PRASA CONFERENCE, 2020, : 666 - 671
  • [42] Fine-grained access control for cloud computing
    Ye, Xinfeng
    Khoussainov, Bakh
    [J]. INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2013, 4 (2-3) : 160 - 168
  • [43] Towards Fault-Tolerant Fine-Grained Data Access Control for Smart Grid
    Jun Wu
    Mianxiong Dong
    Kaoru Ota
    Zhenyu Zhou
    Bin Duan
    [J]. Wireless Personal Communications, 2014, 75 : 1787 - 1808
  • [44] Towards Fine-Grained Online Adaptive Approximation Control for Dense SLAM on Embedded GPUs
    Bu, Tiancong
    Yan, Kaige
    Tan, Jingweijia
    [J]. ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2022, 27 (02)
  • [45] Towards Efficient, Secure, and Fine-Grained Access Control System in MSNs with Flexible Revocations
    Sun, Shi-Feng
    Lyu, Chen
    Gu, Dawu
    Zhang, Yuanyuan
    Ren, Yanli
    [J]. INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2015,
  • [46] Bloccess: Towards Fine-Grained Access Control Using Blockchain in a Distributed Untrustworthy Environment
    Ding, Yepeng
    Sato, Hiroyuki
    [J]. 2020 8TH IEEE INTERNATIONAL CONFERENCE ON MOBILE CLOUD COMPUTING, SERVICES, AND ENGINEERING (MOBILE CLOUD 2020), 2020, : 17 - 22
  • [47] Towards a Fine-Grained Access Control Mechanism for Privacy Protection and Policy Conflict Resolution
    Ha Xuan Son
    Chen, En
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (02) : 507 - 516
  • [48] Towards Fault-Tolerant Fine-Grained Data Access Control for Smart Grid
    Wu, Jun
    Dong, Mianxiong
    Ota, Kaoru
    Zhou, Zhenyu
    Duan, Bin
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2014, 75 (03) : 1787 - 1808
  • [49] Towards Fine-Grained Access Control in Enterprise-Scale Internet-of-Things
    Zhou, Qian
    Elbadry, Mohammed
    Ye, Fan
    Yang, Yuanyuan
    [J]. IEEE TRANSACTIONS ON MOBILE COMPUTING, 2021, 20 (08) : 2701 - 2714
  • [50] Transformation of Prosody in Voice Conversion
    Sisman, Berrak
    Li, Haizhou
    Tan, Kay Chen
    [J]. 2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1588 - 1597