Context Features Based Pre-Selection and Weight Prediction in Concatenation Speech Synthesis System

被引：0

作者：

Liu, Shanfeng ^{[1
]}

Wen, Zhengqi ^{[1
]}

Li, Ya ^{[1
]}

Tao, Jianghua ^{[1
]}

Liu, Bin ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100864, Peoples R China

来源：

2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2014年

关键词：

concatenation speech synthesis; hierarchical pre-selection; weight prediction;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

How to generate natural-sounding synthesized speech has been challenging all the researchers in speech synthesis area. Experiments show that speech concatenated by units selected from large speech corpus has a better performance. However how to limit the searching space and predict weights when calculating target cost is an important problem. This paper presents a detailed hierarchical pre-selection method to limit the searching of space. After three layers of pre-selection, a set of units are selected as the candidate units. In order to ensure the continuity in the duration, the prediction model is used in the hierarchical pre-selection. Meanwhile, M5P algorithm which is combined with decision tree and regression is presented in this paper to predict weights needed in target cost calculation. Experimental result shows that these two approaches can generate high quality speech.

引用

页码：506 / 510

页数：5

共 50 条

[1] Segment pre-selection in decision-tree based speech synthesis systems
Donovan, RE
[J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 937 - 940
[2] A method of unit pre-selection for speech synthesis based on acoustic clustering and decision trees
Blouin, C
Bagshaw, PC
Rosec, O
[J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 692 - 695
[3] Optimizing XGBoost Performance for Fish Weight Prediction through Parameter Pre-Selection
Hamzaoui, Mahdi
Aoueileyine, Mohamed Ould-Elhassen
Romdhani, Lamia
Bouallegue, Ridha
[J]. FISHES, 2023, 8 (10)
[4] Accurate Visual Speech Synthesis Based on Diviseme Unit Selection and Concatenation
Jiang, Dongmei
Ravyse, Ilse
Sahli, Hichem
Zhang, Yanning
[J]. 2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 910 - +
[5] Speech Processing for Arabic Speech Synthesis Based on Concatenation Rules
Imedjdouben F.
[J]. SN Computer Science, 5 (3)
[6] Pre-selection in cointegration-based pairs trading
Marianna Brunetti
Roberta De Luca
[J]. Statistical Methods & Applications, 2023, 32 : 1611 - 1640
[7] Epistatic models and pre-selection of markers improve prediction of performance in corn
John W. Dudley
G. Richard Johnson
[J]. Molecular Breeding, 2013, 32 : 585 - 593
[8] COMPUTER PRE-SELECTION OF COMPOUNDS FOR PHARMACOLOGICAL SCREENING - PREDICTION BY FRAGMENT DESCRIPTION
DEWINTER, ML
[J]. EUROPEAN JOURNAL OF MEDICINAL CHEMISTRY, 1985, 20 (02) : 175 - 179
[9] Pre-selection in cointegration-based pairs trading
Brunetti, Marianna
De Luca, Roberta
[J]. STATISTICAL METHODS AND APPLICATIONS, 2023, 32 (05): : 1611 - 1640
[10] A TDPSOLA Based Concatenation Technique for Bengali Text to Speech Synthesis System Subachan
Swarna, Kamrunnahar
Naser, Abu
[J]. 2016 9TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (ICECE), 2016, : 102 - 105

← 1 2 3 4 5 →