Context Features Based Pre-Selection and Weight Prediction in Concatenation Speech Synthesis System

被引:0
|
作者
Liu, Shanfeng [1 ]
Wen, Zhengqi [1 ]
Li, Ya [1 ]
Tao, Jianghua [1 ]
Liu, Bin [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100864, Peoples R China
关键词
concatenation speech synthesis; hierarchical pre-selection; weight prediction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
How to generate natural-sounding synthesized speech has been challenging all the researchers in speech synthesis area. Experiments show that speech concatenated by units selected from large speech corpus has a better performance. However how to limit the searching space and predict weights when calculating target cost is an important problem. This paper presents a detailed hierarchical pre-selection method to limit the searching of space. After three layers of pre-selection, a set of units are selected as the candidate units. In order to ensure the continuity in the duration, the prediction model is used in the hierarchical pre-selection. Meanwhile, M5P algorithm which is combined with decision tree and regression is presented in this paper to predict weights needed in target cost calculation. Experimental result shows that these two approaches can generate high quality speech.
引用
收藏
页码:506 / 510
页数:5
相关论文
共 50 条
  • [1] Segment pre-selection in decision-tree based speech synthesis systems
    Donovan, RE
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 937 - 940
  • [2] A method of unit pre-selection for speech synthesis based on acoustic clustering and decision trees
    Blouin, C
    Bagshaw, PC
    Rosec, O
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 692 - 695
  • [3] Optimizing XGBoost Performance for Fish Weight Prediction through Parameter Pre-Selection
    Hamzaoui, Mahdi
    Aoueileyine, Mohamed Ould-Elhassen
    Romdhani, Lamia
    Bouallegue, Ridha
    [J]. FISHES, 2023, 8 (10)
  • [4] Accurate Visual Speech Synthesis Based on Diviseme Unit Selection and Concatenation
    Jiang, Dongmei
    Ravyse, Ilse
    Sahli, Hichem
    Zhang, Yanning
    [J]. 2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 910 - +
  • [5] Speech Processing for Arabic Speech Synthesis Based on Concatenation Rules
    Imedjdouben F.
    [J]. SN Computer Science, 5 (3)
  • [6] Pre-selection in cointegration-based pairs trading
    Marianna Brunetti
    Roberta De Luca
    [J]. Statistical Methods & Applications, 2023, 32 : 1611 - 1640
  • [7] Epistatic models and pre-selection of markers improve prediction of performance in corn
    John W. Dudley
    G. Richard Johnson
    [J]. Molecular Breeding, 2013, 32 : 585 - 593
  • [9] Pre-selection in cointegration-based pairs trading
    Brunetti, Marianna
    De Luca, Roberta
    [J]. STATISTICAL METHODS AND APPLICATIONS, 2023, 32 (05): : 1611 - 1640
  • [10] A TDPSOLA Based Concatenation Technique for Bengali Text to Speech Synthesis System Subachan
    Swarna, Kamrunnahar
    Naser, Abu
    [J]. 2016 9TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (ICECE), 2016, : 102 - 105