A NOVEL UNIT SELECTION METHOD FOR CONCATENATION SPEECH SYSTEM USING SIMILARITY MEASURE

被引：0

作者：

Zhang, Ran ^{[1
]}

Tao, Jianhua ^{[1
]}

Li, Ya ^{[1
]}

Wen, Zhengqi ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China

来源：

2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE) | 2013年

关键词：

unit selection; hybird; speech synthesis; target cost;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

this paper presents a new approach to unit selection for corpus-based TTS system, in which the units are selected according to their similarity with synthetic target generated by a parametric synthesizer. In the training stage, a group of classifiers are trained based on human perceptual judgments. The outputs of the classifiers are used to make a distinction rather than using traditional methods such as continuously-valued cost. In order to obtain a better classification result, different combinations of features are tried as input vectors, and the similarity rating is carried out dexterously. Subjective listening tests on a Mandarin female TTS system show that the proposed classifier based speech synthesis system outperforms the traditional unit-selection system.

引用

页数：5

共 50 条

[1] MYANMAR SPEECH SYNTHESIS SYSTEM BY USING PHONEME CONCATENATION METHOD
Hlaing, Chaw Su
Thida, Aye
[J]. PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICSPC'17), 2017, : 399 - 404
[2] Accurate Visual Speech Synthesis Based on Diviseme Unit Selection and Concatenation
Jiang, Dongmei
Ravyse, Ilse
Sahli, Hichem
Zhang, Yanning
[J]. 2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 910 - +
[3] Towards Automatic Measure of Similarity for Use in Unit Selection
Tihelka, Daniel
[J]. ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 637 - 642
[4] A novel prosody adaptation method for Mandarin concatenation-based text-to-speech system
Yu, Jian
Tao, Jianhua
[J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2009, 30 (01) : 33 - 41
[5] Indonesian Text-To-Speech System Using Syllable Concatenation: Speech Optimization
Mengko, Richard
Ayuningtyas, Aulia
[J]. PROCEEDINGS OF 2013 3RD INTERNATIONAL CONFERENCE ON INSTRUMENTATION, COMMUNICATIONS, INFORMATION TECHNOLOGY, AND BIOMEDICAL ENGINEERING (ICICI-BME), 2013, : 412 - 415
[6] A 'personalized' facial expression recognition with fuzzy similarity measure and novel feature selection method
Kim, DJ
Bien, Z
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, PROCEEDINGS, 2004, : 33 - 38
[7] A novel similarity measure for heuristic selection in examination timetabling
Yang, Y
Petrovic, S
[J]. PRACTICE AND THEORY OF AUTOMATED TIMETABLING V, 2005, 3616 : 247 - 269
[8] A NOVEL HYBRID MANDARIN SPEECH SYNTHESIS SYSTEM USING DIFFERENT BASE UNITS FOR MODEL TRAINING AND CONCATENATION
Zhang, Ran
Tao, Jianhua
Li, Ya
Wen, Zhengqi
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[9] Unit-Selection Speech Synthesis Method Using Words as Search Units
Segi, Hiroyuki
[J]. INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2016, 7 (02): : 53 - 67
[10] A Novel Similarity Measure Technique for Clustering Using Multiple Viewpoint Based Method
Potdar, Dushyant S.
Pattewar, Tareek M.
[J]. PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO'16), 2016,

← 1 2 3 4 5 →