Context Features Based Pre-Selection and Weight Prediction in Concatenation Speech Synthesis System

被引:0
|
作者
Liu, Shanfeng [1 ]
Wen, Zhengqi [1 ]
Li, Ya [1 ]
Tao, Jianghua [1 ]
Liu, Bin [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100864, Peoples R China
关键词
concatenation speech synthesis; hierarchical pre-selection; weight prediction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
How to generate natural-sounding synthesized speech has been challenging all the researchers in speech synthesis area. Experiments show that speech concatenated by units selected from large speech corpus has a better performance. However how to limit the searching space and predict weights when calculating target cost is an important problem. This paper presents a detailed hierarchical pre-selection method to limit the searching of space. After three layers of pre-selection, a set of units are selected as the candidate units. In order to ensure the continuity in the duration, the prediction model is used in the hierarchical pre-selection. Meanwhile, M5P algorithm which is combined with decision tree and regression is presented in this paper to predict weights needed in target cost calculation. Experimental result shows that these two approaches can generate high quality speech.
引用
收藏
页码:506 / 510
页数:5
相关论文
共 50 条
  • [41] A Novel Pavement Crack Detection Approach Using Pre-selection Based on Transfer Learning
    Zhang, Kaige
    Cheng, Hengda
    [J]. IMAGE AND GRAPHICS (ICIG 2017), PT I, 2017, 10666 : 273 - 283
  • [42] The Optimization of LRU algorithm based on pre-selection and cache prefetching of files in hybrid cloud
    Du, Shumeng
    Li, Chunlin
    Mao, Xijun
    Yan, Wei
    [J]. 2016 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2016, : 125 - 132
  • [43] SIMARD: A Simulated Annealing Based RNA Design Algorithm with Quality Pre-Selection Strategies
    Sav, Sinem
    Hampson, David J. D.
    Tsang, Herbert H.
    [J]. PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [44] A Close Look into the Probablistic Concatenation Model for Corpus-based Speech Synthesis
    Sakai, Shinsuke
    Maia, Ranniery
    Kawai, Hisashi
    Nakamura, Satoshi
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 744 - 747
  • [45] Complexity scalable motion estimation based on modes pre-selection in H.264
    Zhang, DM
    Huang, C
    Lin, SX
    Shen, YF
    Yu, LJ
    [J]. 2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 1183 - 1186
  • [46] Interested Sample Point Pre-Selection Based Dense Terrain Reconstruction for Autonomous Navigation
    Lin, Lili
    Zhou, Wenhui
    [J]. 2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 3, PROCEEDINGS, 2009, : 339 - +
  • [47] Vowel Onset Point based Waveform Concatenation Technique for Intelligible Speech Synthesis
    Panda, Soumya Priyadarsini
    Nayak, Ajit Kumar
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC), 2017, : 622 - 626
  • [48] Tree-based Context Clustering Using Speech Recognition Features for Acoustic Model Training of Speech Synthesis
    Chanjaradwichai, Supadaech
    Suchato, Atiwong
    Punyabukkana, Proadpran
    [J]. 2015 12TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2015,
  • [49] Clustering-based return prediction model for stock pre-selection in portfolio optimization using PSO-CNN plus MVF
    Ashrafzadeh, Mahdi
    Taheri, Hasan Mehtari
    Gharehgozlou, Mahmoud
    Zolfani, Sarfaraz Hashemkhani
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (09)
  • [50] Supplier pre-selection for platform-based products: a multi-objective approach
    Cao, Yan
    Luo, Xinggang
    Kwong, C. K.
    Tang, Jiafu
    [J]. INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2014, 52 (01) : 1 - 19