A NOVEL UNIT SELECTION METHOD FOR CONCATENATION SPEECH SYSTEM USING SIMILARITY MEASURE

被引:0
|
作者
Zhang, Ran [1 ]
Tao, Jianhua [1 ]
Li, Ya [1 ]
Wen, Zhengqi [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
关键词
unit selection; hybird; speech synthesis; target cost;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
this paper presents a new approach to unit selection for corpus-based TTS system, in which the units are selected according to their similarity with synthetic target generated by a parametric synthesizer. In the training stage, a group of classifiers are trained based on human perceptual judgments. The outputs of the classifiers are used to make a distinction rather than using traditional methods such as continuously-valued cost. In order to obtain a better classification result, different combinations of features are tried as input vectors, and the similarity rating is carried out dexterously. Subjective listening tests on a Mandarin female TTS system show that the proposed classifier based speech synthesis system outperforms the traditional unit-selection system.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] MYANMAR SPEECH SYNTHESIS SYSTEM BY USING PHONEME CONCATENATION METHOD
    Hlaing, Chaw Su
    Thida, Aye
    [J]. PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICSPC'17), 2017, : 399 - 404
  • [2] Accurate Visual Speech Synthesis Based on Diviseme Unit Selection and Concatenation
    Jiang, Dongmei
    Ravyse, Ilse
    Sahli, Hichem
    Zhang, Yanning
    [J]. 2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 910 - +
  • [3] Towards Automatic Measure of Similarity for Use in Unit Selection
    Tihelka, Daniel
    [J]. ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 637 - 642
  • [4] A novel prosody adaptation method for Mandarin concatenation-based text-to-speech system
    Yu, Jian
    Tao, Jianhua
    [J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2009, 30 (01) : 33 - 41
  • [5] Indonesian Text-To-Speech System Using Syllable Concatenation: Speech Optimization
    Mengko, Richard
    Ayuningtyas, Aulia
    [J]. PROCEEDINGS OF 2013 3RD INTERNATIONAL CONFERENCE ON INSTRUMENTATION, COMMUNICATIONS, INFORMATION TECHNOLOGY, AND BIOMEDICAL ENGINEERING (ICICI-BME), 2013, : 412 - 415
  • [6] A 'personalized' facial expression recognition with fuzzy similarity measure and novel feature selection method
    Kim, DJ
    Bien, Z
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, PROCEEDINGS, 2004, : 33 - 38
  • [7] A novel similarity measure for heuristic selection in examination timetabling
    Yang, Y
    Petrovic, S
    [J]. PRACTICE AND THEORY OF AUTOMATED TIMETABLING V, 2005, 3616 : 247 - 269
  • [8] A NOVEL HYBRID MANDARIN SPEECH SYNTHESIS SYSTEM USING DIFFERENT BASE UNITS FOR MODEL TRAINING AND CONCATENATION
    Zhang, Ran
    Tao, Jianhua
    Li, Ya
    Wen, Zhengqi
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [9] Unit-Selection Speech Synthesis Method Using Words as Search Units
    Segi, Hiroyuki
    [J]. INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2016, 7 (02): : 53 - 67
  • [10] A Novel Similarity Measure Technique for Clustering Using Multiple Viewpoint Based Method
    Potdar, Dushyant S.
    Pattewar, Tareek M.
    [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO'16), 2016,